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. to cover the above fees. 
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IN THE UNITED STATES 
PATENT AND TRADEMARK OFFICE 



Applicant :ZOU, Jitao et al. 

Title : DIACYLGLYCEROL ACYLTRANSFERASE GENE FROM 

PLANTS 

K3RBY EADES GALE BAKER, 
Box 3432, Stn. D, 
Ottawa, Ontario. 
Canada KIP 6N9 

The Hon. Commissioner of Patents 

And Trademarks, 
Washington DC 20231 U.S. A. 

Dear Sir: 

PRELIMINARY AMENDMENT 

As a Preliminary Amendment, please amend this application as follows. 

IN THE DISCLOSURE 

Page 1, immediately following line 1, add: 

- CROSS-REFERENCE TO RELATED APPLICATION 
This application claims the priority rights of prior provisional U.S. Patent 
Application Serial No. 60/112,812 filed December 17, 1998, (now 
abandoned). ~ 

REMARKS 



The purpose of this amendment is merely to add a cross-reference to a related 
application. 




Edwin Gale 
Reg No. 28,584 
Tel (613) 237-6900 
Our File No. 43922 
September 29, 2000 
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File No.: 43922 



IN THE UNITED STATES 
PATENT AND TRADEMARK OFFICE 



Applicant 
Title 

ATTENTION 



: ZOU, Jitao et al. 

: DIACYLGLYCEROL ACYLTRANSFERASE GENE FROM 
PLANTS 

: BOX SEQUENCE 



The Hon. Commissioner of Patents 

And Trademarks, 
Washington DC 20231 U.S.A. 

Dear Sir: 



KIRBY EADES GALE BAKER, 
Box 3432, Stn. D, 
Ottawa, Ontario. 
Canada KIP 6N9 



STATEMENT UNDER 37CFR § 1.825 

I hereby state that the content of the paper copy of the Sequence Listing in the 
specification of the present application is the same as the content of the Sequence Listing in 
Computer-readable form on the diskette filed concurrently herewith. 




Edwin Gale 
Reg No. 28,584 
Tel (613) 237-6900 
Our File No. 42748 
September 29, 2000 



l3flbSWr/FTO 10 JUL 2051 
09/62351* 



Attorney's Docket No. 43922 



IN THE US PATENT AND TRADEMARK OFFICE 



ZOU, Jitao; et al. 
09/623,514 
October 3, 2000 

DIACYLGLYCEROL ACYLTRANSFERASE GENE FROM PLANTS 



Applicant: 
Serial no.: 
Filed: 
Title: 
Examiner: 
Art Unit: 

ATTENTION: BOX SEQUENCE 



Kirby Eades Gale Baker 
Box 3432 Stn. D 
Ottawa, Ontario 
Canada KIP 6N9 



The Hon. Commissioner of Patents & Trademarks 
Washington, DC 20231, USA 



Sir: 



Amendments to Sequence Listing and Computer Readable Copy thereof 

under 37 CJFJlg L825 



This is with reference to the Notification of Missing Requirements mailed on May 
14, 2001. Please amend this application as follows: 

IN THE DISCLOSURE 

Cancel the Sequence Listing (having its own page numbering scheme, pages 1 to 
20), and replace this with the new Sequence Listing provided herewith as new pages 1 
to 21. 



A marked-up copy of the original Sequence Listing showing the amendments 
made at this time. 



2 



REMARKS 



The Notice of Missing parts contained a Notification to Comply with 
Requirements for Patent Applications Containing Nucleotide Sequences and/or Amino 
Acid Sequence Disclosures. 

A paper copy of an amended Sequence Listing and a copy in computer readable 
form on diskette are enclosed herewith. 

This amendment corrects the Sequence Listing. This submission includes no new 

matter. 

Also enclosed herewith is a statement of identity under 37 CFR § 1.821(f). 
In addition, a copy of the Notice to Comply is attached. 



Edwin J. Gale 

Reg. No. 28,584 

Tel. (613) 237-6900 

Fax (613) 237-0045 

E-Mail: egale@kirbyeades.com 



July 9, 2001 
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Attorne 



IN THE US PATENT AND TRADEMARK OFFICE 



Applicant: ZOU, Jitao; et al. 
Serial no.: 09/623,514 
Filed: October 3, 2000 

Title: DIACYLGLYCEROL ACYLTRANSFERASE GENE FROM PLANTS 

Examiner: 

Art Unit: 

ATTENTION: BOX SEQUENCE 

Kirby Eades Gale Baker 
Box 3432 Stn. D 
Ottawa, Ontario 
Canada KIP 6N9 

The Hon. Commissioner of Patents & Trademarks 
Washington, DC 20231, USA 



Sir: 



Statement under 37 CFR S 1.821(fl 



This is with reference to the accompanying response of even date to the Notice to 
Comply mailed on May 14, 2001. 

The undersigned hereby states that the copy of the Sequence Listing in computer 
readable form on the attached diskette is the same as the substitute copy of the Sequence 
Listing enclosed in the aforesaid response. 



July 9, 2001 




Edwin J. Gale 

Reg. No. 28,584 

Tel. (613) 237-6900 

Fax (613) 237-0045 

E-Mail: egale@kirbyeades.com 
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DIACYLGLYCEROL ACYLTRANSFERASE GENE FROM PLANTS 



TECHNICAL FIELD 

This invention refates to plant genes useful for the genetic manipulation of 
plant characteristics. More specifically, the invention relates to the identification, 
5 isolation and introduction of diacylglycerol a cy I transferase (DGAT) genes useful, for 
example, for altering the seed oil content, the ratio of diacylglycerol / triacylglycerol 
proportions in the seed oil, fatty acid synthesis, seed oil acyl composition, seed 
size/weight and carbon flux into other seed components, in commercial or crop 
plants. 

10 BACKGROUND ART 



Plant seed oils are major sources of essential polyunsaturated fatty acids for 
human diets and renewable feedstocks for chemical industries. The enzymes of the 
fatty acid synthase complex in the plastids of developing seeds are responsible for 
the biosynthesis of fatty acids that are channeled into the cytosolic acyl-CoA pool to 

15 sustain triacylglycerol accumulation. Triacylglycerol (TAG) biosynthesis is located in 
the endoplasmic reticulum with glycerol-3-phosphate and fatty acyl-CoAs as the 
primary substrates. There are three acyltransferases involved in the plant storage 
lipid bioassembly, namely the glycerol-3-phosphate acyltransferase (GPAT, EC 
2.3.1.15), the /yso-phosphatidic acid acyltransferase (LPAT, EC 2.3.1.51) and the 

20 diacylglycerol acyltransferase (DGAT, EC 2.3.1.20). These three acyltransferases 
catalyze the stepwise acylation of the glycerol backbone with the final step being the 
acylation of sn-1 , 2-diacylgiycerol (DAG) by DGAT into the formation of TAGs, a 
biochemical process generally known as the Kennedy pathway (Stymne and 
Stobart, 1987). 

25 Among the three ER-based fatty acyl-CoA acyltransferases, only LPAT 

gene(s) have been cloned from plants (Knutzon eta/. t 1995, Lassner e£a/., 1995), 
Like several other enzymes involved in storage lipid biosynthesis, acyltransferases 
are intrinsic ER proteins and are extremely difficult to purify. The research on plant 
DGAT has been largely limited to studies of activity profiles by using the particulate 

30 fractions generated by differential centrifugation of seed or microspore-derived 
embryo homogenates (Weselake etaL, 1993). Although partial purification of 



BIHPWMIHMiW^ll^BtgM-IBtillSKlffifiiaiTaBj 



4-12^2000 PCT/CA99/01202 

2 

OGAT from cotyledons of germinating soybean seeds wae reported {Kwanyuan and 
Wilson, 1963). detailed molecular characterization of this enzyme is racking. 

Reference Is made to RB1u012 Database Entry Ac00S817; Accession 
number AC005917; 4 November 19BS; UN X. ET AL : "Afabldopsis thallana 
5 chromosome II section 11 3 of 255 of the complete sequence*. This relates to a 
nucleotide sequence deposit flnst submitted to the NCEkt GenBanfc on November 3, 
1998 without any Identification of putative coding sequences contained therein. 
There was no reference in the deposited materials to a "diecylgiycevol O* 
acyltransfaraee gene", 

1 0 Reference is also made to VESNA KATAVIC ET AL; "Afteratian of Seed 

Fatty Acid Composition by an Ethyl Methanesulfonate-tnduced Mutation In 
Arabldoplsls thallana Affecting Dfacyiglycerol Acyltransferase Activity"; PLANT 
p PHYSIOLOGY, vol 108, 1B85, pp. 398-409- This reference discloses an 

U Arabidopsis mutant designated AS1 1 that has reduced dlacyJglycerol 

il 15 acyltransferase activity. The reference does not disclose any DMA sequences 
Uj whatsoever and teaches only 1hst alterations in D<3AT activity may lead to changes 

s f} In fatty acid content. 

f 9 Accordingly, white the Kennedy pathway is known and shows the steps In 

p the biosynthesis of TAGa In pfants. there has not been any identification and use of 

W 20 a genetic element that can be used reliably in plants to modify TAG synthesis and 

composition in e way that may be exploited commercially. 

K pisctPSUftS OF THE INVENTION 

An object of the invention is to identity, isolate and clone a genetic element 
that may be used to modify the natural formation of triacylglycerols in plants in order 
25 to increase the yield of commercial plant oils, or to modify their composition to 
achieve specific commercial Improvements of plants and plant products. 

Another object of the Invention Is to Identify* Isolate and characterize 
diacylg^ycero] acyitrstnsferase (DGAT) gene and cE>MA sequences from Arabidopsis 
and to utilize these sequences in the genetic manipulation of plants. 
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Another object of the invention is to provide a vector containing the full- 
length DQAT cDNA sequence from Arabldopsls in a sense orientation under the 
control of e seed-specific promoter (e.g. napin; See Josefsson et a)., 1987; Radke 
et aL 1988; Voelker et aL, 1992), for reintroducing into Arabktopsis or for 
5 introducing Into other plants. 

Another object of the invention it to provide a vector containing a genomic 
Fragment from Arabidopsts consisting of the full-length DGAT gene under the 
control of Its own 5* upstream regulatory sequences, for reintroducing Into 
Ambfdops'm or for introducing into other plants. 

10 Another object of the Invention Is to provide a method to construct a vector 

containing the full-length DGAT sequence or a significant portion of 1ha DQAT 
sequence from Arabidopsts, in an arttisense orientation under control of either a 
constitutive or a seed-specific promoter, for reintroducing into Arabldopsls or for 
introducing into other plants. 
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Another object of the invention is to provide a method of modifiying 
Arabidopsis and other plants to change their seed oil content. 

Another object of the invention ts to provide a method of modifiying 
Arabidopsis and other plants to change the acyl composition of their seed oil. 

5 Another object of the invention is to provide a method of modifiying 

Arabidopsis and other plants to change their average seed weight or size. 

According to one aspect of the present invention, there is provided a vector 
containing isolated and purified deoxyribonucleic acid (cDNA) of SEQ ID NO:1 
(pDGATcDNA; ATCC No PTA-989), for introduction of the cDNA in a sense 
10 orientation into a plant cell. 

As another aspect of the present invention, there is provided a vector 
containing isolated and purified genomic deoxyribonucleic acid (genomic DNA) of 
SEQ ID NO:3 (pDGATgene; ATCC No PTA-988), for introduction of the gene in a 
sense orientation into a plant cell. 

15 According to yet another object of the invention, there is provided a method 

for preparing a vector containing SEQ ID NO:1 or a part thereof, for introduction of 
the gene or partial gene in an antisense orientation, into a plant cell. 

According to yet another object of the invention, there is provided seed of 
Arabidopsis thaliana ecotype Columbia mutant AS1 1 (ATCC No. PTA-1013) and 

20 characterization of its lipid phenotype (Katavic et al., 1995; Zou et ai. 1999). The 
AS11 mutant seed line has an insertion mutation at the TAG1 locus on 
chromosome ll, and produces plants exhibiting reduced DGAT activity (Figure 4) 
and an reduced TAG/DAG ratio during seed development (Table 1) f resulting in an 
altered seed fatty acyl composition (Figure 2), reduced oil content (Table 1), and 

25 increased seed oil dtacylglycerol content during development (Figure 3) and at 

maturity (lower TAG/DAG ratio cf Table 1). The cDNA sequence of the AS11 DGAT 
is shown in SEQ ID NO:23 } the genomic DNA sequence is shown in SEQ ID NO:24 
and the translated protein sequence of the AS11 DGAT is shown in SEQ ID NO:25. 
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The invention also relates to transgenic plants and plant seeds having a 
genome containing an introduced DNA sequence of SEQ ID NO:1 or SEQ ID NO:3, 
and a method of producing such plants and plant seeds. 

The invention also relates to SEQ ID NO:1 or SEQ ID NQ;3, or a part of 
5 SEQ ID NO:1 or SEQ ID NO:3, or SEQ ID NO:1 containing an 81 bp insertion [SEQ 
ID NO: 23] or SEQ ID NO:3 containing an 147 bp insertion [SEQ ID NO:24] such 
that the deduced amino acid sequence of the encoded protein contains the 
repeated sequence SH AGLFNLCVWU AVNSRLI I ENLM K [SEQ ID NO:25] where 
the spacing and identity of the underlined amino acids are identical or are replaced 
10 by conserved substitutions, characterized in that said sequence has been 

introduced in sense or antisense orientation, and a method of producing such plants 
and plant seeds. 

As will be appreciated by persons skilled in the art, the invention also relates 
to substantially homologous DNA sequences from plants encoding proteins with 

1 5 deduced amino acid sequences of 25% or greater identity, and 40% or greater 
similarity, isolated and/or characterized and/or designed by known methods using 
the sequence information of SEQ ID NO:1 or SEQ ID NO:3 or or SEQ ID NO:1 
containing an 81 bp insertion [SEQ ID NO:23] such that the deduced amino acid 
sequence of the encoded protein contains the repeated sequence 

20 SH AG LFN LCWVU AVNSRLI I ENLM K (SEQ ID NO:25) where the spacing and 
identity of the underlined amino acids are identical or are replaced by conserved 
substitutions, and to parts of reduced length that are still able to function as 
inhibitors of gene expression by use in an anti-sense, co-suppression (Transwitch; 
Jorgensen and Napoli 1 994) or other gene silencing technologies. It will be 

25 appreciated by persons skilled in the art that small changes in the identities of 
nucleotides in a specific gene sequence may result in reduced or enhanced 
effectiveness of the genes and that in some applications (e.g. anti-sense or co- 
suppression), partial sequences often work as effectively as full length versions. 
The ways in which the gene sequence can be varied or shortened are well known to 

30 persons skilled in the art, as are ways of testing the effectiveness of the altered 
genes. All such variations of the genes are therefore claimed as part of the present 
invention. 
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Other preferred degrees of identity to the indicated sequences are at least 
30%, 40%, 50%, 60%, 70%, 80%, 90% and 95%; and other preferred degrees of 
similarity are at least 50%, 60%, 70%, 80%, 90% and 95%. The inventors have 
used a computer program known as MegAIign®, DNASTAR® of DNASTAR Inc., 

5 1228 South Park Street, Madison, Wl 53715, USA, for assessing homology. This 
program is based on the Clustal V algorithm (Higgins and Sharp (1998): A package 
for performing multiple sequence alignment on a microcomputer; GENE 73:237- 
244). For each gap introduced in the alignment, the program deducts a penalty 
from the score. A higher gap penalty suppresses gapping; a lower value promotes 

10 it. The program also assesses penalties based on the length of the gap. The more 
residues the gap spans, the greater the penalty. The program deducts these 
penalties from the overall score of the alignment. 

Stated more generally, the present invention relates to the isolation, 
purification and characterization of a diacylglycerol acyitransferase (DGAT) gene 
15 from the Brassicaceae (specifically Arabidopsis thaliana) and demonstrates its utility 
in regulating fatty acid synthesis, seed oil content, diacylglycerol/triacylglycerol 
ratios and seed size/weight. Until now, no concrete data is available on the gene 
structure of plant DGATs, or their utility in altering oil content or composition through 
genetic manipulation. 

20 When considering altered oil contents or compositions, results from 

averages of statistically-significant numbers of plants or seeds according to the 
invention are best compared with results from averages of statistically-significant 
numbers of untransformed (control) plants or seeds of the same genotype grown 
under identical conditions at the same time. This allows for the variability of 

25 individual plants of the same genotype, particularly when such plants are grown 
under different conditions. The actual number of plants or seeds used to form the 
required average may vary, but should be enough to provide a generally constant 
average whenever such number is selected. Generally, the number should be at 
least 10, and is more preferably at least 20, 30, 50 or 100. 

30 The DGAT gene was cloned, characterized and authenticated from 

Arabidopsis by: (1) selection and characterization of plant ESTs sharing some 
homology to mammalian acyl-CoA: cholesterol acyltransferases; (2) the functional 



expression of a full-length cDNA in yeast; (3) the characterization and isolation of the 
DQAT ( TAQ1) gen© from Arabklopsia mutant AS1 1 containing an Insertion mutation In the 
DGAT gena and a seed oil phenotype which consists of an altered DAG/TAG ratio, and en 
altered oil content and aoyi composition; (4) complementation of the ASH mutant by 
Insertion of the DQAT cDNA sequence to restore a wild-type fatty acid composition; (5) the 
over-expressfon of the DGAT cDNA In wilcMype A. thaiiana transgenic plants which 
produce seeds with an Increased oil contents increased average seed weight and altered 
seed oil acyl composition. 

The A, thaHana DGAT structure fa significantly homologous (over 40% Identical 
over a region of more then 400 amino acids) to its mammalian counterparts, and ia highly 
homologous to subsequently reported putative & napus DGATs at both the nucleotide 
(92%) and the deduced amino acid (g0%) levete (Nykiforuk et al, 1999; GenBank/EMBL 
Accession No AF155224; AF164434). 

The DGAT of the current invention Is useful In manipulating DGAT activity, and 
triacylgtycerol Woassembty In plants. For example, by transforming plants with a construct 
containing the DGAT gene in a sense orientation, under the control of a ttesu©-specJfic 
promoter (e.g. seed-apecffic promoter napln), the expression of DGAT and accumulation of 
seed oil can be enhanced or tha acyl composition of tha seed oil altered. Yet another 
example would be to express the DGAT cDNA under the control of a constitutive promoter 
(e.g, 35S (Datla et al., 1B03)) to Increase the TAG content of vegetative tissues (leaves, 
roots, stems). This may have particular advantages for altering the starch/oit ratio in root 
crops. 

Alternatively, DGAT expression can be silenced to some degree by anti-sense or 
co-suppression (Transwttch) phenomena (Da Lange et a!., 1995; Mol et al„ 199Q; 
Jorgensen and Napoli, 1994; Kinney, 1996; Voucheret et at, 1998: Taylor 1996). For 
example, silencing DGAT in a seed specific manner may result tn a reduction in TAG 
accumulation. This could have applications in reducing the oB contant In seed barley to 
enhance stability during storage. As a second example, seed»apedflc silencing may lead 
to a relatively high accumufation of DAG or an increase in the DAG/TAG ratio In the 
developing or mature seed. As yet another exampla, the expression of the mutated DGAT 
gena which results in a 27 amino ■ — — 
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acid repeat insertion in the mutant DGAT protein {See Figure 5 a) can be used to 
alter the DAG/TAG ratio in developing and mature seed. Such manipulations can 
lead to edible seed oils produced naturally in the plant, containing enhanced relative 
levels of DAG /reduced levels of TAG (See Figure 3; Table 1 ) to act as all-natural 

5 emuisifiers in the food and confections industries, or to enhance the 

nutritional/health profile of vegetable oils as functional foods (e.g. as cooking oils, 
stir fry oils, in salad dressings, margarines etc.) by inhibiting neutral fat deposition 
in humans. Processed oils produced from canola and soybean which contain 
increased proportions of diacylglycerol have been cited by the Kao Corporation of 

10 Japan (e.g. Econa Cooking Oil; Kao Corporation Kl, 1-14-10 Nihonbashi- 

Kayabacho, Chuoku, Tokyo 103 Japan; e-mail: 210064@kastanetkao.co.jp) as a 
product making it difficult for blood neutral fat to increase after a meal, and for fat to 
cling to the body, thereby assisting individuals who are overweight or who suffer 
from high neutral fat levels. As a third example, silencing or reducing the activity of 

15 DGAT in a seed specific manner (as observed in mutant AS1 1 ; e.g. by over- 
expressing SEQ ID NO:23 or silencing expression of SEQ ID NO:1 or SEQ ID 
NO:3), and combining this trait with the capacity to produce polyhydroxyalkanoates 
(PHAs; e.g. polyhydroxybutyrate) in seeds (Poirier et aL, 1992; 1995) will allow an 
increased flow of unesterified fatty acids towards p-oxidation (Poirier et aL, 1999). 

20 By recycling or diverting the unesterified fatty acids into p oxidation, the resulting 
acetyi-CoA moieties will lead to a significant increase in polyhydroxyalkanoates 
(PHAs) or a change in PHA composition (Poirier et aL, 1999). Transgenic plants 
producing PHAs in seeds have potential for utility as biodegradeable thermoplastics. 
However, up to now, the levels of PHAs produced have been relatively small 

25 (Poirier et al, 1992; 1995). The utility of transgenicaily reducing the DGAT activity to 
significantly enhance PHA production (e.g. 10-fold increase) in PHA-producing 
seeds is now possible, due to the current DGAT invention. 

Some of the manipulations and deliverables which are possible using the 
DGAT gene or a part therof, include, but are not limited to, the following: seeds with 
30 increased or decreased oil content; seeds containing oils with an enhanced 

diacylglycerol content, seed oils with an altered acyl composition; plants producing 
larger or heavier seeds; plants exhibiting an enhanced or altered capacity to 
accumulate storage compounds in other storage organs (e.g. tubers, roots). 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram illustrating the Kennedy pathway for the bioassembiy 
of triacylglycerols in plants, and shows the critical rote played by DGAT as the final 

step of the Kennedy pathway, 

5 Figure 2 is a graph showing the comparison of the fatty acid composition of 

seed oil from wild-type (WT) and DGAT mutant AS 11 lines of Arabidopsis thaliana. 
Proportions of fatty acids are reported as Mol % of the total fatty acid composition of 
the seed oil from each line. Error bars are ± SE (n= 10 plants of each line sampled, 
with 50 seeds per sample per analysis). 

1 0 Figure 3 is a graph showing a comparison of the DAG content in developing 

seed of wild-type (WT) and DGAT mutant AS1 1 lines of Arabidopsis thaliana. More 
specifically, it is a comparison of the fatty acid content of DAG pool in wild-type 
green developing seeds compared to that of the AS11 mutant. 

Figure 4 is a graph showing a comparison of the DGAT activity 
15 (pmol/min/mg protein) in developing seeds at the milky, early green, mid-green and 
brown-green stages of embryo development in wild-type (WT) and DGAT mutant 
AS11 lines of Arabidopsis thaliana. Developing seeds at each stage were selected 
and DGAT enzyme analyses conducted as described previously by Katavic et aL, 
1995). 

20 Figure 5(a) shows the amino acid sequence alignment of the Arabidopsis 

DGAT (AtTAGI) [SEQ ID NO:2] with mammalian (mouse and putative human) 
DGATs. MDGAT, mouse DGAT [SEQ ID NO:4J; GenBank/EMBL Accession No. 
AF078752 (Cases etaL, 1998)]; HARGP1 , human ARGP1 protein [SEQ ID NO:5]; 
GenBank/EMBL Accession No. AF059202; Oelkers et al., 1998]. Dots indicate 

25 gaps. Identical amino acid residues are highlighted in black. Conserved residues 
are shaded. The 27-amino acid repeat found in Arabidopsis thaliana mutant AS1 1 
and generated by the insertion mutation (81 bp) found in SEQ ID NO:23 

(SHAGLFNLCVWLIAVNSRLIIENLMK) is overlined thus: . The putative 

diacylglycerol binding site is overlined thus: . The SnRK1 targeting site is 

30 overlined thus: ===s== with an asterisk (*) over the serine (S) residue as the 
targeted phosphorylation site. 
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Figure 5(b) shows the Kyte-Doolittle hydropathy plot of the DGAT protein. 

Figure 6(a) shows the results of a Northern analysis of TAG1 gene 
expression in Arabidopsis thaliana. Total RNA was extracted from roots (RT), 

leaves (LF), flowers (FL), young seedlings (YS), developing sfliques (SL), and 
5 germinating seeds (GS). 

Figure 6(b) shows the results of Southern biot analysts of the TAG1 gene in 
Arabidopsis thaliana. Genomic DNA was digested with restriction enzymes BglH 
{Lane 1), EcoRI (Lane 2), and Hindi!) (Lane 3). The TAG1 DNA probe was 32 P 
labeled by random priming. 

10 Figure 7(a) is a diagrammatic representation of the TAG1 gene structure. 

The boxes indicate the 16 exons (solid boxes for coding regions, open box for 
untranslated regions), and the lines represent the 15 introns. A, B and C denote the 
positions of the primers used for PCR amplifications of the segments from wild type 
(WT) and AS1 1 . The specific primers A, B and C are described in Experimental 

15 Procedures: Primer Strategy (found later in this specification). 

Fig 7(b) shows gel separation of the PCR products amplified from wild type 
(WT) and AS1 1 . Lane 1 , PCR product with primers A and B using WT genomic 
DNA as template. Lane 2 t PCR product with primers A and B using AS1 1 genomic 
DNA as template. Lane 3, PCR product with primers C and B using WT genomic 
20 DNA as template. Lane 4, PCR product with primers C and B using AS1 1 genomic 
DNA as template. Lane 5, RT-PCR with primers A and B using RNA prepared from 
WT seedling RNA. Lane 6, RT-PCR with primers A and B using RNA prepared 
from AS 11 seedling RNA. 

Figure 8 is a graph showing microsomal DGAT activity in Yeast Strain YMN5 
25 Transfomred with empty plasmid (pYES Con) and with the A. thaliana DGAT cDNA 
(p YES *. DGAT). This illustrates the expression of the TAG1 cDNA in yeast. Host 
cultures of strain YMN5 were transformed with pYES2 plasmid only (pYES2\ without 
TAG1 insert) or with pYES2 containing the TAG1 cDNA insert (pYES2:TAG1). 
Following induction in the presence of galactose, transformants were lysed and 
30 assayed for DGAT activity as described in the Experimental Procedures. The 
results of two separate DGAT activity experiments are shown. 
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Figure 9 is a map of plasmid pSE129A which may be used as a vector. The 
vector contains the following salient features for plant transformation in the current 
invention: the seed-specific napin promoter and NOS terminator between which is a 
multiple cloning site. 

5 Figure 10 is a graph showing the complementation of the AS1 1 DGAT 

mutation with the wild-type cDNA. Transformation of Arabidopsis thaliana mutant 
line AS1 1 with the DGAT cDNA [SEQ ID NO:1] under the control of a napin 
promoter, leads to a restoration of the wild-type (WT) fatty acid composition in the 
seed oil of the transformant lines 3-4, 9-1, 14-2 and 9-4. Fatty acid composition (wt 
10 %) was determined on the seed oil extracted from 100-seed samples from A. 
thaliana non-transformed controls (WT), mutant line AS11, and T 2 seeds of 
napin:DGAT transgenic lines. 

Figure 1 1 is a graph showing the seed oil content of non-transformed WT 
control, and pRD400 control (empty plasmid) and napin:DGAT T 2 transgenic 

15 Arabidopsis thaliana seed lines. More particularly, the graph shows the 

transformation of wild type (WT) Arabidopsis thaliana with the DGAT cDNA [SEQ ID 
NO:1] under the control of a napin promoter, leads to a higher seed oil content in 
the DGAT transgenic lines. Oil content is expressed as |ig total fatty acids (FAs) per 
100 seeds from A. thaliana non-transformed controls (WT Con), and T 2 seeds of 

20 pRD400 control (empty plasmid) transgenic, and napin:DGAT transgenic lines 1\ 2\ 
9, 10 and 1 1 . Standard error bars for the control lines are indicated; n=10. 

Figure 12 is a graph showing the average 100-seed weight of non- 
transformed WT control, and pRD400 control (empty plasmid) and napin:DGAT T 2 
transgenic Arabidopsis thaliana seed lines. More specifically, the graph shows the 
25 over-expression of the DGAT cDNA under the control of a napin promoter, in wild- 
type (WT) Arabidopsis thaliana leads to a higher average seed weight. The average 
weight of 100-seed samples from A. thaliana non-transformed controls (WT Con), 
and T 2 seeds of pRD400 control (empty plasmid) transgenic, and napin;DGAT 
transgenic lines 1 2\ 9, 10 and 1 1 are reported as mg dry weight (DW). 

30 Figure 13 is a graph showing the positive correlation between oil content 

(expressed as jig Total fatty acids (FAs) per 100 seeds) and average seed weight 
(expressed as average mg DW per 100-seed samples) from A. thaliana non- 
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transformed controls (WT Con and T2 seeds of pRD400 control (empty 
plasmid) transgenic (() and napin:DGAT transgenic lines 1 \ 2\ 9, 1 0 and 11 ( ■ ) are 
reported as mg dry weight (DW). 

BEST MODES FOR CARRYING OUT THE INVENTION 

5 Fig. 1 is a diagram illustrating the Kennedy pathway for the biosynthesis of 

TAGs in plants. Of the various illustrated enzymes, DGAT is the only enzyme in 
the Kennedy pathway that is exclusively committed to TAG biosynthesis, and its key 
role is apparent from the scheme of Figure 1 . sn-1 ,2-DAG, generated as a result of 
either the catalytic action of PA phosphatase (EC 3.1 .3.4) or CPTase (EC 2.7.8,2), 

0 can be used in the biosynthesis of TAG. 

For this reason, the inventors of the present invention decided to investigate 
DGAT to see if the corresponding gene in plants could be sequenced and cloned 
and used to modify the seed oil content and fatty acid composition of plants in a 
way that could be commercially useful. 

1 5 The acyi-CoA dependent acylation of ,2-DAG is catalyzed by DGAT 

(Stymne and Stobart, 1987). In developing and germinating seeds of oilseed 
plants, TAG accumulation and DGAT activity have been shown to associate with the 
endoplasmic reticulum (ER; high speed microsomal fraction) (Stobart et aL, 1986; 
Cao and Huang, 1986; Stymne and Stobart, 1987; Frentzen, 1993; Settlage et ai, 

20 1995; Lacey and Hills, 1996). The biochemical properties of microsomal DGAT have 
been examined in a number of plant systems (Frentzen, 1993) including developing 
seeds (Bernerth and Frentzen, 1990; Vogel and Browse, 1996; Cao and Huang, 
1987) and embryo cultures (Taylor et aL, 1991; Weseiake et a!., 1991; Taylor et al. f 
1992; Little et al. t 1994) of B. napus L In general, studies with developing seeds 

25 indicate that DGAT activity increased rapidly during the active phase of oil 

accumulation and then decreases markedly as seed lipid content reaches a plateau 
(Tzen et a!., 1993; Weseiake et ai., 1993). 

A number of studies with both mammalian (Mayorek et al., 1989; Tijburg et 
al. v 1989) and plant (Ichihara et ai., 1988; Perry and Harwood, 1993 a and 1993 b; 
30 Settlage et al. 1995) systems have suggested that DGAT may catalyze a rate- 
limiting reaction in TAG bioassembly. However, this hypothesis has not been 
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rigorously tested, and has not been reduced to practice by transgenic expression of 
any DGAT gene in plant or animal systems, until now. Developing seeds of 
S. napus L, cv Shiralee, have been shown to produce significant levels of DAG 
during the active phase of oil accumulation suggesting that DGAT catalyzed 
5 reaction may regulate the flow of carbon into TAG (Perry and Harwood, 1993 a and 
1993 b). In addition, an ethyl methanesulfonate-induced (EMS-induced) mutant of 
A thaliana, designated AS1 1 , has been shown to have a reduced DGAT activity 
that correlated with both an increased DAG pool and decreased accumulation of 
TAG (Katavic et al. 1995). Given its possible rate-limiting role in TAG bioassembly, 
10 the inventors of the present invention have identified DGAT as a potential target in 
the genetic modification of plant lipid biosynthesis. 

Previously, the partial characterization of an EMS-induced Arabidopis 
thaliana mutant, AS11, with altered fatty acid composition was reported (Katavic et 
a/., 1995). In comparison to wild type plant seeds, AS11 seeds have reduced levels 

15 of the very long chain fatty acid eicosenoic acid (20:1 ) and reduced oleic acid (18:1 ) 
and accumulate a-linoienic acid (18:3) as the major fatty acid in triacylglycerols 
(Figure 2). The AS1 1 mutant has a consistently tower ratio of TAG/DAG in 
developing seeds, and it accumulates an elevated amount of seed DAG (Figure 3), 
the substrate of the diacylglycerol acyltransferase. Through a series of biochemical 

20 analyses, it was shown that AS1 1 had reduced diacylglycerol acyltransferase 
activities throughout seed development (Figure 4), AS1 1 also had a reduced (by 
25-30%) oil content phenotype, providing some evidence that DGAT may be 
controlling flux into TAG biosynthesis, as shown in Table 1 below. The AS1 1 did not 
have a wrinkled-seed phenotype as described in other low-seed-oil mutants (Focks 

25 and Benning, 1998). 



imWHiiBIMII>W^>gi»' mwrHUHWiwri 



WO 00/36114 PCT/CA99/01202 

13 

Table 1 



Comparison of AS 11 [Katavic et al., (1995) and wild-type A. thaliana seeds 
with respect to lipid profiles at mid-development, and the relative TAG, DAG and 
sterol ester contents in AS11 and WT seeds at maturity. 



Seed Type 


TAG/DAG ratio 


TAG/DAG ratio 


Relative TAG 


Sterol Esters at 




at mid- 


at maturity 3 


content at 


maturity (% of 




development 3 




maturity 15 


Totol Lipid 








c (nmol/mg DW) 


Extract)* 


WT 


17 


90 


1.00" 


0.8 








c (255) 




AS11 


5 


20 


0.6" 


1.15 








c (174) 





a Embryos staged and lipids measured as described in Katavic et ai., (1995); 
b Relative TAG content of 200-seed samples of AS1 1 and WT were meaured by MASS-1 H- 
NMR according to the method of Rutar (1989). The integration response for resonances 
attributabe to liquid-like oil were summed and the value for AS1 1 seed is reported relative to 
the response for the WT control seed sample (the latter set at a value of 1 ,00); C TAG content 
(nmotes/mg DW) measured by transmethylation of a TLC-purified TAG fraction, followed by 
GC analysis of fatty acid methyl esters; d A total lipid extract was prepared as dedscribed by 
Taylor et al., (1991; 1992), and sterol esters isolated and characterized as described in the 
Experimental Procedures. 

Genetic analysis indicated that the fatty acid phenotype is caused by a 
semidominant mutation in a nuclear gene, designated TAG1. The mutation was 
mapped to chromosome II, and was estimated to lie in the region approximately 
17.5 + 3 cM from the sti locus and 8 + 2 cM from the cp2 locus. 

5 Because a DGAT gene has not heretofore been cloned from any plant, until 

now, it has not been possible to address the possibility of genetic modifications to 
alter carbon flux, increase fatty acid synthesis, oil content, oil acyl composition, or 
seed size, by modulating plant DGAT activity. 

However, there are many examples of successful modifications to plant 
10 metabolism that have been achieved by genetic engineering to transfer new genes 
or to alter the expression of existing genes, in plants. It is now routinely possible to 
introduce genes into many plant species of agronomic significance to improve crop 
performance (e.g. seed oil or tuber starch content/composition; meal improvement; 
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herbicide, disease or insect resistance; heavy metal tolerance etc.) (MacKenzie and 
Jain, 1997; Budziszewski et al., 1996; Somerville, 1993; Kishore and Somerville, 
1993). 

For example, increases in the proportions of some strategic fatty acids and 
5 in the quantities of seed oil have been achieved by the introduction of various fatty 
acid biosynthesis and acy (transferase genes in oilseed crops. These include the 
following demonstrations: Expression of an anti-sense construct to the stearoyi- 
ACP A9 desaturase in Brassicaceae led to an increase in the stearic acid content 
(Knutzon et al., 1992). Expression of a medium chain fatty acyi-ACP thioesterase 

10 from California Bay, in Brassicaceae was demonstrated to increase the iauric acid 
(12:0) content (Voeiker et al., 1992; 1996). Expression of a Jojoba p keto-acyi-CoA 
synthase in low erucic acid Brassicaceae led to an increase the level of erucic acid 
(22:1); the effect following expression in high erucic acid cuitivars was negligible 
(Lassner et ai., 1996). Increased proportions of oleic acid in Brassica napus and in 

15 soybean have been achieved by silencing the microsomal FAD2 (A12) desaturase 
(Hitz et ai., 1995; Kinney, 1995 ; 1997). Transformation of Arabidopsis thaiiana and 
rapeseed (B. napus) with a yeast sn-2 acyltransferase resulted in seed oils with 
increased proportions of 22:1 and other very long-chain fatty acids and significant 
increases in seed oil content (Zou et ai., 1997). 

20 Starch deposition has also been altered by genetic engineering. By 

expression of a mutant E. coli gigC16 gene encoding an ADP glucose 
pyrophosphoryiase in potato tubers, an increase in starch accumulation was 
achieved (Stark et al., 1992). 

The inventors therefore considered the DGAT gene to hold great promise for 
25 the desired modification of TAGs in plants. 

The best modes for carrying out the invention will be apparent from the 
following description of the results of tests and experiments that have been carried 
out by the inventors. 

The inventors chose to use the well-accepted model plant system 
30 Arabidopsis thaiiana for the cloning of DGAT, as a host system for genetic 

engineering to alter DGAT expression, and to study the effects of altering DGAT 
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expression on seed triacytglyceroi bioassembly. This is because, over the past 
several years, Arabidopsis thaliana, a typical flowering plant, has gained increasing 
popularity as a model system for the study of plant biology. As a result of the ease 
with which this plant lends itself to work in both classical and molecular genetics, 
5 Arabidopsis has come to be widely used as a model organism in plant molecular 
genetics, development, physiology and biochemistry (Meyerowitz and Chang, 1985; 
Meyerowitz, 1987; Goodman et al., 1995). This model dicotyledonous plant is also 
closely related to Brassica crop species and it is increasingly apparent that 
information concerning the genetic control of basic biological processes in 
1 0 Arabidopsis will be transferable to other species (Lagercrantz et al., 1 996). 

Indeed, there are numerous examples wherein studies of the molecular 
biology and biochemistry of a particular metabolic pathway or developmental 
process and the possibility of genetically engineering a plant to bring about changes 
to said metabolic pathway or process, has first been tested in the model plant 
15 Arabidopsis, and then shown to yield similar phenotypes in other plants, particularly 
crop plants. 

For example, the extra- piastidial membrane associated oleate (18:1 ) A1 2 
(co-6) desaturase gene, FAD2> was originally studied and eventually cloned from 
Arabidopsis thaiiana, by identifying the lesion found in an A. thaliana mutant 

20 defective in desaturating oleate to produce linoleate (18:2) on the 

phosphatidylcholine backbone. This resulted in a high oleic acid phenotype in the A. 
thaliana seed oil (Okuley et al., 1994). Genetic engineering of both soybean 
(Glycine max.) and canola B. napus to silence the indigenous FAD2 gene(s) in a 
seed-specific manner by anti-sense or co-suppression approaches, resulted in 

25 similar high oleic acid seed oil phenotypes (Kinney, 1995; 1997). 

Transgenic expression of a yeast sn~2 acyltransferase (SLC1-1) gene to 
achieve enhanced seed oil and very long-chain fatty acid content was first 
performed in Arabidopsis and later shown to yield similar phenotypes in transgenic 
rapeseed (S. napus) experiments (Zou et a!., 1997). Arabidopsis thaliana has 
30 repeatedly shown itself to be a useful model system for metabolic engineering of 
metabolic pathways (e.g. lipid biosynthesis, photosynthesis) or processes 
(organogenesis, reproductive development etc.) common to ail higher plants. 
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in the area of secondary metabolism/signal transduction, an anthocyanin 
pathway-specific transcriptional activator from the monocot maize designated as R 
(the myc transcription factor involved in activation of biosynthetic genes for 
anthocyanin production in the aieurone celts of maize kernels), was expressed in 

5 the dicot Arabidopsis, causing augmented anthocyanin pigmentation in the 
infioresecences. Subsequent expression in another dicot, tobacco (Nicotiana 
tabacum), resulted in similar floral pigmentation changes (Lloyd et al., 1992). These 
experiments demonstrate that whole pathways common to all flowering plants can 
be co-ordinately controlled through the introduction of transcriptional regulators, and 

10 that the mechanisms are common to diverse plant species. 

In the context of the current invention, all plant seeds accumulate some 
triacylgiycerol (oil) and this ubiquitous process is affected, at least in part, by the 
activity of a microsomal DGAT, as explained previously. Thus, many of the effects 
observed following genetic engineering to modulate DGAT expression in 

15 Arabidopsis can be expected and predicted to result in similar phenotypes when 
carried out in all other plants. For example, after the present invention was made, 
information has become available that supports the findings of the present inventors 
by showing that S. napus has a highly homologous DGAT gene (Nikiforuk et al., 
1999), and thus B. napus is a clear target for similar genetic modifications as those 

20 shown for A, thaliana. 

There are a number of ways by which genes and gene constructs can be 
introduced into plants, and a combination of plant transformation and tissue culture 
techniques have been successfully integrated into effective strategies for creating 
transgenic crop plants. These methods, which can be used in the present 

25 invention, have been extensively reviewed elsewhere (Potrykus, 1991; Vasil, 1994; 
Walden and Wingender, 1995; Songstad et al., 1995), and are well known to 
persons skilled in the art. For example, one skilled in the art will certainly be aware 
that, in addition to Agrobacterium-mediated transformation of Arabidopsis by 
vacuum infiltration (Bechtold et al., 1993) or wound inoculation (Katavic et al M 

30 1994), it is equally possible to transform other plant and crop species, using 

Agrobacterium Ti-plasmid-mediated transformation (e.g. hypocotyl (DeBlock et aL, 
1989) or cotyledonary petiole (Moloney et al, 1989) wound infection), particle 
bombardment/biolistic methods (Sanford et aL, 1987; Nehra et aL, 1994; Becker et 
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al., 1994) or polyethylene glycol-assisted protoplast transformation (Rhodes et al. f 
1988; Shimamoto et ai., 1989) methods. 

As will also be apparent to persons skilled in the art, and as extensively 
reviewed elsewhere (Meyer, 1995; Datia et aL f 1997), it is possible to utilize plant 
5 promoters to direct any intended up- or down-regulation of transgene expression 
using constitutive promoters (e.g. those based on CaMV35S), or by using 
promoters which can target gene expression to particular cells, tissues (e.g. napin 
promoter for expression of transgenes in developing seed cotyledons), organs (e.g. 
roots), to a particular developmental stage, or in response to a particular external 
10 stimulus (e.g. heat shock). 

Particularly preferred plants for modification according to the present 
invention include Arabidopsis thaliana, borage (Borago spp.), Canofa, castor 
(Ricinus communis), cocoa bean (Theobroma cacao), corn (Zea mays), cotton 
(Gossypium spp), Crambe spp., Cuphea spp., flax (Linum spp.), Lesquerella and 

15 Limnanthes spp., Linola, nasturtium {Tropaeolum spp.) ( Oenothera spp., olive (O/ea 
spp.), palm (Elaeis spp.), peanut (Arachis spp.), rapeseed, safflower (Carthamus 
spp.), soybean (Glycine and Soja spp.), sunflower (Helianthus spp.), tobacco 
(Nicotiana spp.), Vernonia spp., wheat (Triticum spp.), barley (Hordeum spp.), rice 
(Oryza spp.), oat (Avena spp.) sorghum (Sorghum spp.), rye (Secale spp.) or other 

20 members of the Gramineae. 

The present invention is particularly useful when used to modify the yield or 
composition of oilseed produced from oilseed crops. Oilseed crops are plant 
species that are capable of generating edible or industrially useful oils in 
commercially significant yields, and include many of the plant species listed above, 
25 Such oilseed crops are well known to persons skilled in the art. 

RESULTS 

isolation of the TAG1 (DGAT) cDNA from Arabidopsis thaliana 

Since one of the most likely defects in AS1 1 mutant is at the DGAT itself 
(Table 1; Fig 4), the inventors attempted cloning strategies based on sequence 
30 information of enzymes that share common substrates with DGAT. One of the 
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candidate enzymes that would serve this purpose is the acyl-CoA: cholesterol 
acyltransferase (ACAT, EC 2.3.1 .26) (Chang et a/., 1997). Like DGAT, ACAT is an 
ER protein functioning as an O-acyltransferase by using acyl-CoA as the fatty acyl 
donor for the esterification of free cholesterol to generate sterol esters. Through a 
5 BLAST database search, the inventors identified an Arabidopsis thaliana expressed 
sequence tag (EST) [accession no. AA042298; SEQ ID NO:6] with a deduced 
amino acid sequence showing 41 % identity to that of the yeast acyl-CoA: 
cholesterol acyltransferase (Yang et a/ M 1996, Yu ef at., 1996), within the short 
sequence (104 amino acids) that was available for the EST. 

10 The corresponding cDNA (E6B2T7) clone was obtained from the 

Arabidopsis Biological Resource Center, Columbus, Ohio. Upon complete 
sequencing, the 878 bp E6B2T7 clone was found to be a partial cDNA. However, 
the ORF prediction from this partial cDNA confirmed the initial EST search results in 
that the encoded product is structurally similar to ACAT, especially in the regions at 

15 the C-terminus. The inventors were confident that the cDNA contained the 3' 
untranslated region through an ORF search, although the polyA tail was missing. 

The inventors further used the partial cDNA sequence to search against 
Arabidopsis thaliana genomic sequence information. An Arabidopsis 'IGF BAC 
clone 1 F27F23' [accession no. AC003058] was identified to include a region that 

20 matched the cDNA, and therefore it was concluded that this was the region 
encompassing the corresponding gene. Moreover, this BAC clone *F27F23\ is 
contained in the YAC clone, C1C06E08, which, according to the published map 
position (http://weeds.mgh.harvard.edu/goodman/c2_b.htmt), represents a region 
between centimorgan 35.9 and centimorgan 38.7 on chromosome II; this position is 

25 similar to the estimated location for TAG1, and the lesion identified by the mutation 
in AS11 (Katavic et a/., 1995). In view of our previous results on the 
characterization of the AS1 1 mutant, the map position of this gene strongly 
suggested that it may encode a DGAT. 

To clone a full-length cDNA, a series of oligonucleotide primers were 
30 designed, based on the genomic sequences located at different positions 5' 
upstream of the region covering the partial cDNA. We used these primers in 
combination with a primer located at the 3' UTR of the partial cDNA (E6B2T7) to 
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perform PCR reactions with cDNA phagemid prepared from an Arabidopsis thaiiana 
(ecotype Columbia) silique-specific cDNA library (Giraudat et aL, 1992) as a 
template. The longest cDNA amplified was 1904 bp, which we subsequently 
designated as TAG1, and deposited into the Genbank under accession AJ238008 

5 [SEQ ID NO:1]. We beiieve this cDNA represents a full-length done because its 
size is in agreement with that of the transcript detected in the northern blot (see 
Figure 6 a). The longest open reading frame is flanked by a 134-nucleotide 5' 
untranslated region and a 203-nucleotide 3' untranslated region. There is an in- 
frame stop codon (TGA at position nt-43) which is followed by an in-frame ATG at 

10 position nt-139. It is thus inferred that the ATG at position nt-139 is the initiation 
codon. 

The primary structure of TAG1 predicts a DGAT-related enzyme 

The predicted open reading frame of the TAG1 cDNA encodes for a 
polypeptide of 520 amino acids with a calculated molecular weight of 58993 

15 Daltons. With the BLAST search program (Altschui et aL 1990), it was found that 
the recently reported mouse diacylglycerol acyltransferase [accession no. 
AF078752] (Cases etal., 1998) is a protein which showed the highest sequence 
similarity to the deduced amino acid sequence of TAG1 (Figure 5a). TAG1 was 
also similar to a human acyl CoA: cholesterol acyltransferase-related enzyme 

20 [accession no. AF059202], The human acyl CoA: cholesterol acyltransferase- 
related enzyme, also known as ARGP1 , is most likely to be a DGAT with no 
significant ACAT activity, although the true nature of the enzyme awaits further 
confirmation (Oelkers et a/., 1998). The similarity between TAG1 and the 
mammalian DGAT extends over a region of more than 400 amino acids with a 

25 sequence identity of about 41%. A putative diacylgiycerol/phorbol ester-binding 
motif, HKW-X-X-RH-X-Y-X-P, a signature sequence observed to be unique to 
DGAT while absent in the ACATs (Oelkers et aL, 1998), is located at amino acids 
414-424 ([SEQ ID NO: 7]; Figure 5 a). This diacylglycerol binding motif is also found 
in the subsequently published B. napus DGAT sequences (Nikyiforuk et al, 1999; 

30 GenBank /EMBL Accession Nos. AF1 55224, SEQ ID NO:8; AF1 64434, SEQ ID 
NO:9). Among other cloned acyltransferases (e.g. GPATs, LPATs, 
dihydroxyacetone phosphate acyltransferases) it has been reported that there is an 
invariant proline in a highly hydrophobic block IV that may participate in acyl-CoA 
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binding (Lewin et al M 1999). In the TAG1 sequence, the hydrophobic block from 
residues 221-229 containing an invariant proline at residue 224, might constitute 
such a motif. 

TAG1 showed some sequence similarity to other acyi CoA: cholesterol 
5 acyltransferases from a number of species (Chang et a/., 1997). However, the 
similarity is largely confined to the C-terminus and is lower (around 30%) than is the 
similarity of TAG1 to the mammalian DGAT. 

The TAG1 protein has multiple hydrophobic domains (Figure 5b) and an 
analysis by the PC Gene program predicted that the protein has 5 possible 

10 transmembrane segments (amino acids 178-195, 233-253, 363-388, 433-476, 486- 
507). in the mammalian DGAT, a putative tyrosine phosphorylation motif was 
observed (Cases ef a/., 1998), but no apparent tyrosine phosphorylation site could 
be found in TAG1. However, a visual examination revealed a consensus sequence 
(X-L 200 -X-K 202 -X-X-S 205 -X-X-X-V 2Q9 ; SEQ ID NO: 10) identified as a targeting motif 

15 typical of members of the SnRK1 protein kinase family, with serine residue 205 
being the residue for phosphorylation. The SnRK1 (SNF1 -related protein kinase-1) 
proteins are a class of Ser/Thr protein kinases that have been increasingly 
implicated in the global regulation of carbon metabolism in plants (Halford and 
Hardie, 1998). This concensus SnRK1 targeting motif is also found in the 

20 subsequently published B. napus DGAT sequences (Nikyiforuk et a!, 1999; 
GenBank /EMBL Accession Nos. AF1 55224; AF1 64434). Interestingly, similar 
SnRK1 targeting motifs could also be identified in the /yso-phosphatidic acid 
acyltransferases (LPATs) from coconut (Knutzon et a/., 1995) and meadowfoam 
(Lassner efa/., 1995), respectively. 

25 The TAG1 gene is ubiquitously expressed in Arabidopsis 

Northern blot analyses were performed to investigate the expression profile 
of the TAG1 gene. Total RNA was extracted from different tissues, including roots, 
leaves, flowers, developing siiiques, young seedlings and germinating seeds. The 
highest steady-state level accumulation of TAG1 transcript was in RNA isolated 
30 from germinating seeds and young seedlings (Figure 6a). TAG1 transcripts were 
also detected in root, leaf and flower tissues, albeit at lower levels. Surprisingly, the 
TAG1 gene is expressed in developing siiiques at a level that is comparable to that 
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of other vegetative tissues, but lower than that of germinating seeds and young 
seedlings. This expression profile in general is not inconsistent with the notion that 
DGAT is present in all plant tissues capable of TAG biosynthesis (Kwanyuan and 
Wilson, 1986). it has been shown in a number of plant species, including soybean 
5 and saffiower, that germinating seeds actively synthesize TAGs (Ichihara and Noda, 
1981; Kwanyuan and Wilson, 1986; Wilson and Kwanyuan, 1986), The relatively 
high level of expression in roots is also consistent with the fact that root plastids are 
capable of synthesizing large amounts of triacyiglycerol (Sparace et a)., 1992). 

Southern blot hybridization (Southern, 1975) was performed with genomic 
0 DNA digested with several restriction enzymes including Bgilf, EcoRI and Hindlll. 
The TAG1 gene has no internal Bglll and Hindlll site, while one internal EcoRI site 
exists. Our Southern analysis suggested that TAG1 most likely represents a single 
copy gene in the Arabidopsis genome (Figure 6b). 

An insertion mutation is found in the TAG1 gene in mutant AS11 

5 Alignment of the genomic sequence (accession no. AC003058; SEQ ID 

NO:3) with that of the TAG1 cDNA [SEQ ID NO:1] revealed that the TAG1 gene 
contains 16 exons and 15 introns, spanning a region of about 3.4 kb (Figure 7a). 
DNA containing the TAG1 allele from AS1 1 was PCR-amplified and completely 
sequenced. The AS1 1 TAG1 allele has a 147-bp insertion located at the central 
20 region of intron 2. The insertion is a duplication of a segment that is composed of 
12 bp from the 3' end of intron 1, the entire sequence of exon 2 (81 bp) and 54 bp 
from the 5' end of intron 2 (Figure 7a). 

In order to rule out the possibility of PCR artifacts, two sets of primers were 
used to perform further PCR amplifications. Primers A and B (see Experimental 

25 Procedures, Primer Strategy) located in exons 1 and 3, respectively, amplified a 
DNA fragment that is about 150 bp longer from AS1 1 (Figure 7b, lane 2) than that 
from the wild type (Figure 7b, lane 1). The second pair of primers, C and B 
(Experimental Procedures), with one to be found in both exon 2 and the insertion 
segment, and the other located in exon 3, generated two amplified fragments from 

30 AS1 1 (Figure 7b, lane 4), white only one from the wild type (Figure 7b, lane 3). 
Hence these results confirmed that the insertion mutation the inventors identified 
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through sequencing, reflected the true nature of the mutation in the TAG1 gene in 
the AS1 1 genome. 

The AS1 1 TAG1 transcript has an 81-bp insertion in its open reading frame 

Northern blot analyses indicated that there was no difference in the 
5 expression profiles of the TAG1 gene, between the AS1 1 mutant and wild type A. 
thaliana. in order to investigate the effect of the mutation at the transcript level, 
reverse-transcription PCR (RT-PCR) was performed to amplify the TAG1 transcript 
from RNA extracted from germinating seedlings of mutant AS1 1 . Sequencing 
analysis revealed that there is an 81-bp insertion composed entirely of exon 2 in the 

10 transcript from AS1 1 . The exon 2 in the repeat is properly spliced. The alteration of 
the transcript thus does not disturb the reading frame. However, this additional 
exon 2 sequence in the AS1 1 transcript would result in an altered DGAT protein 
with the 27 amino acid insertion 131 SHAGLFNLCVWLIAVNSRLIIENLMK 157 [SEQ ID 
NO:1 1]. The inventors' data shows that this insertion results in a 40-70% reduction 

15 in DGAT activity throughout seed development (Katavic et aL, 1995). The 81 bp 
insert responsible for reduced DGAT activity in AS1 1 is visible in the comparison of 
RT-PCR products (Figure 7b: Compare lane 5 (WT) and lane 6 (AS1 1 ).) The DNA 
aberration observed in the AS1 1 mutant was unexpected, since ethyl 
methanesulfonate (EMS) generally causes point mutations. Although we cannot 

20 rule out the possibility that this AS1 1 mutant was the result of a spontaneous 

mutation event, EMS-induced deletions and insertions have been reported in other 
systems (Mogami ef a/., 1986, Okagaki et aL, 1991) 

The TAG1 gene insertion in Arabidopsis mutant AS1 1 affects seed 
triacylglycerol accumulation, but not sterol ester accumulation in seeds. 

25 Because TAG1 also showed some sequence homology to acyl CoA: 

cholesterol acyltransferases (ACATs) from a number of species (Chang et aL, 
1997), the inventors compared both triacylglycerol and sterol ester accumulation in 
seeds of the wild-type A. thaliana and AS1 1 mutant. While the triacylglycerol 
content and TAG/DAG ratios were reduced in AS1 1 (i.e. increased proportion of 

30 seed oil DAGs t ) in contrast, the proportions of sterol esters in WT and AS1 1 seeds 
were similar, at 0.8 and 1% of the total lipid extract, respectively (Table 1). If the 
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TAG1 lesion affected ACAT-like activity, one might expect a reduction in seed sterol 
esters, but this was not observed. These results indicated that TAG1 is not involved 
in sterol-ester homeogenesis, and thus not an acyl CoA: sterol acyltransferase. 

TAG1 expression in yeast 

5 The TAG1 cDNA overexpressed in yeast resulted in a 3.5 to 4-fold increase 

in microsomal DGAT activity compared to plasmid only (pYES2) control 
transformants (Fig. 8), confirming that the TAG1 gene product functions as a DGAT. 
When 14 C18:1-CoA was added to the yeast lysates, sterol esters were also labeled 
in vitro (data not shown), but there was no significant difference in the 14 C-labeled 
10 sterol esters produced by lysates from the pYES2 GAL-induced control and the 
pYES2:TAG1 Gal-induced transformant. This confirms that the TAG1 product does 
not encode an acyl-CoA: sterol acyltransferase (like ACAT). 

Complementation of the A. thaliana AS 11 Mutant Line by Transformation 
with the DGAT cDNA. 

15 The cloned full-length DGAT cDNA was used as a template for PCR 

amplification with the primers DGATXbal (CTAGTCTAGAATGGCGATTTTGGA; SEQ 
IN NO: 12) and DGATXhol (GCGCTCGAGTTTCATGACATCGA; SEQ ID NO:13) to 
provide new restriction sites on each end of the sequence as described in 
Experimental Procedures. A 1.6kb fragment was excised by a Xbal/Kpni digestion 

20 and iigated into the corresponding sites of the pSE129 vector (provided by Dr. P. 
Covello, PBI/NRC). pSE129A is a vector derived from the plant transformation 
vector pRD400 (Datla et aL 1992), The vector pSE129A contains the seed-specific 
napin promoter and the nos terminator cloned into the EcoRI and Hindlll sites of the 
pRD400 piasmid (Figure 9). Hence in the DGAT-pSE129A construct the 

25 Arabidopsis DGAT cDNA is under the control of the napin promoter. The construct 
integrity was confirmed by sequencing. 

The pSE129A containing the napin:DGATcDNA was introduced into A. 
tumefaciens, used to transform A. thaliana mutant AS11, and progeny analyzed as 
described in Experimental Procedures. A number of T 2 transgenic lines were 
30 isolated which complemented the fatty acid mutant phenotype found in AS1 1 

(reduced 20:1 and elevated polyunsaturated C 18 s), restoring the wiid-type seed fatty 
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acid profile (Figure 10), This finding confirms the nature of the lesion in AS11 and 
directly ties the AS1 1 lipid phenotype to this mutation. 

Over-Expression of the DGATcDNA in Wild-Type A. thaiiana 

The cloned full-length DGAT cDNA was used as a template for PCR 
5 amplification with the primers DGATXbal (CTAGTCTAGAATGGCGATTTTGGA ; SEQ 
ID NO:12) and DGATXhol (GCGCTCGAGTTTCATGACATCGA; SEQ ID NO:13)to 
provide new restriction sites on each end of the sequence as described in 
Experimental Procedures. A 1.6kb fragment was excised by a Xbal/Kpnl digestion 
and ligated into the corresponding sites of the pSE129 vector (provided by Dr. P. 
10 Covello, PBl/NRC). pSE129A is a vector derived from the plant transformation 
vector pRD400 (Datla et aL 1992). The vector pSE129A contains the seed-specific 
napin promoter and the nos terminator cloned into the EcoRI and Hindll! sites of the 
pRD400 plasmid (Figure 9). Hence in the DGAT-pSE129A construct, the 
Arabidopsis DGAT cDNA is under the control of the napin promoter. The construct 
15 integrity was confirmed by sequencing. 

The pSE129A containing the napin:DGATcDNA was introduced into A. 
tumefaciens, used to transform wild-type A. thaiiana, and progeny analyzed as 
described in Experimental Procedures. A number of T 2 transgenic lines were 
isolated which exhibited an increased oil content (Figure 1 1) an increased average 
20 100-seed weight (Figure 12) and a strong linear correlation between the two traits 
(Figure 13). 

In terms of fatty acyi composition, wild type lines contaning over-expressed 
DGAT cDNA showed a decrease in the total saturates, and increases in the 
monounsaturates and in the 18:1/[18:2 + 18:3] index, as shown in Table 2 below. 
25 Such changes are all towards a 'healthier" oil profile, and can be applied directly to 
canoia, other oilseeds in the Brassicaceae and other edible oil crops to produce 
similar oil composition improvements. 
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Table 2 

Fatty acid composition of seed oil from A thaliana non-transformed wiid-type 
controls (WT Con) and three T2 transgenic lines (2\ 9 and 1 1) of wild type 
transformed with the DGAT cDNA under the control of a napin promoter 

(napin: DGAT). 



Line 


Total Saturates 3 
Wt% 


Monounsaturates" 
Wt% 


18:1/[18:2+ 18:3] 
index 0 


WT Control 


15.1 ±0.1 


36.7 + 0.2 


29.9 + 0.6 


2' napin:DGAT 


13.4 


38.6 


34.1 


9 napin:DGAT 


13.1 


39.3 


35.6 


11 napin:DGAT 


13.1 


38.3 


33.0 



a Includes 16:0, 18:0, 20:0, 24:0 
b includes 18:1, 20:1, 22:1, 24:1 
c ([Wt % 18:1] + [Wt % 18:2 + Wt % 18:3]) x 100 



EXPERIMENTAL PROCEDURES 
Plant Material 

5 Arabidopsis thaliana ecotype Columbia and mutant AS1 1 were grown under 

conditions described previously (Katavic et ah, 1995). The A. thaliana mutant line 
AS1 1 was generated and characterized relative to wild type (WT) A. thaliana 
ecotype Columbia, as described by Katavic et aL, (1995); (ATCC NO: PTA-1013). 

DNA manipulation 

10 Standard methods and procedures were used for DNA preparation, plasmid 

propagation and isolation (Sambrook et a/., 1989). Sequencing was conducted on 
an Applied Biosystems Model 373A DNA Sequencing System using the Taq 
DyeDeoxy™ Terminator Cycle Sequencing Kit (Applied Biosystems, Inc.)- The 
nucleotide and the deduced amino acid sequences were compared with sequences 

15 available in databanks using the BLAST program (Altschul et al. t 1990). 
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Southern and Northern analysis 

Total RNA was extracted from different tissues at various developmental 
stages, using the method of Lindstrom and Vodkin (1991 ). RNA samples were 
denatured with formaldehyde and separated on 1.2% formaidehyae-agarose geis. 
5 About 5 |ig of total RNA was loaded, and the amount of RNA per lane was 

calibrated by the ethidium bromide-staining intensity of the rRNA bands. Genomic 
DNA was isolated, digested with restriction enzymes and a Southern blot analysis 
was performed according to Sambrook et aL (1989). The TAG1 DNA probe was 32 P 
labeled by random-priming according to protocols of the manufacture (BRL). 

10 PCR strategy 

Primers used for the amplification of the TAG1 gene were as follows: 
DGAT1 (agacacgaatcccattcccaccga; SEQ ID NO: 14), DGAT2 
(agtggtgacaacgcagggatgatg; SEQ ID NO: 15), DGAT3 (atggtcgctcccacattgtgt; 
SEQ ID NO: 16), DGAT4 (catacaatccccatgacatttatca; SEQ ID NO: 17). DGAT1 

15 and DGAT2 amplify the 5' half of the TAG1 gene and DGAT3 and DGAT4 amplify 
the 3' end of the TAG1 gene. Genomic DNA from AS1 1 was used as template for 
PCR amplification of the mutant TAG1 allele using the thermal profile: 94° C 3 min; 
40 cycles of 94° C 30 seconds, 62° C 45 seconds, 72° C 1 min; and 72° C 15 min. 
To further confirm the mutation, primers A (cgaccgtcggttccagctcatcgg: [SEQ ID 

20 NO:18]) and B (gcggccaatctcgcagcgatcttg; [SEQ ID NO:19]), a$ well as primers C 
(taaacagtagactcatcatcg; [SEQ ID NO:20]) and B, were used in pairs, respectively, 
to amplify the internal fragment containing the mutation. The primers DGAT1 and 
DGAT4 were used for PCR amplification of the cDNA with an A. thaliana silique 
cDNA library as template. Primers A and B were also used in RT- PCR 

25 amplification of the cDNA fragment encompassing the insertion segment. 

Construction of TAG 1 Multicopy Vector and Transformation and 
Characterization of DGAT Expression in Yeast 

The TAG1 cDNA was cloned into pBluescript SK as described (Hadjeb and 
Berkowitz, 1996). The cDNA was cut out from the vector with Kpni/Xbal, and 
30 subsquently cloned into the respective sites of the yeast expression vector pYes2 
(Invitrogen). The construct was confirmed by sequencing. Constructs with TAG1 
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transcription under the control of the GAL1 promoter released a fragment of 
approximately 1.9 kb. Because the TAG1 fragment has its own initiating ATG 
codon, the product expressed is not a fusion protein. As a host for yeast expression, 
an SLC deletion strain (YMN5 [sld A2::LEU2 ura3 ]) (kindly provided by M.M. 

5 Nagiec and R.C. Dickson, University of Kentucky, Lexington, KY; Nagiec et al., 
1993) was used; we reasoned that in this mutant, the endogenous DAG pool may 
be lower than in WT yeast, and that this would allow us to maximize the activity 
from over-expressed TAG1 in the presence of exogenously supplied 14 C-DAG 
during in vitro DGAT assays of transformant lysates. Yeast transformation was 

10 performed according to Elble (1992). YMN5 transformants containing vector only 
(pYES2) were used as controls. Single colonies were cultured overnight in 20 mL of 
SD medium (Synthetic Dextrose medium with glucose and without uracil, as 
described by Ausubei et a!., 1995, Vol. 2, p. 13.1.3) on a rotary shaker (270 rpm) at 
28°C. Cells were pelleted from the overnight culture and resuspended in 50 mL of 

15 medium for induction of expression (SD medium containing galactose and without 
uracil). Cells were reincubated at 28°C, with shaking at 270 rpm, and harvested 
after 4-6 hr. GAL-induced yeast transformants were harvested by centrifugation at 
5000 rpm for 5 min and resuspended in 100 mM Hepes-NaOH, pH 7.4, containing 1 
mM EDTA and 1 mM DTT. Cell lysates were prepared using acid-washed glass 

20 beads as described by Ausubei et al. (1995). Protein in yeast lysates was measured 
using the Bradford (1976) assay, protein levels in each lysate were normalized and 
aliquots (250 jag protein) were assayed for DGAT activity as described below. 

Lipid Substrates and DGAT Analyses 

14 C-labeled dioiein [1- 14 C oleic] (Sp. activity 55 mCi/ mmol) was purchased 
25 from American Radiolabeled Chemicals (St. Louis, MO). The 14 C-labeiled sn-1,2- 
diolein isomer was purified by TLC on borate-impregnated plates and emulsified in 
Hepes buffer the presence of 0.2% Tween-20 as described by Taylor et aL, (1991 ). 
20:1 -CoA, CoASH, ATP, and all other biochemicals were purchased from Sigma. 

DGAT assays were conducted at pH 7.4, with shaking at 100 rpm in a water 
30 bath at 30°C for 30-60 min. Assay mixtures (0.5 mL final volume) contained lysate 
protein (250 ng), 90 mM Hepes-NaOH, 0.5 mM ATP, 0.5 mM CoASH, 1 mM MgCI 2 , 
200 jiM sn-1,2 dioiein (sp. activity 2 nC/nmol) in 0.02% Tween 20, and 18 nM 20:1- 
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CoA as the acyl donor. The 14 C-iabeied TAGs were isolated by TLC and quantified 
as described by Taylor et al (1991 ). 

Further Lipid and Sterol Ester Analyses in AS 11 and WT: 

Total lipid extracts (TLEs), and lipid class analyses in WT and the AS1 1 
5 mutant were performed as described by Taylor et a/., (1991 ; 1992) and by Katavic 
et a/., (1995). Relative seed oil content was also measured by magic angle sample 
spinning 1 H-NMR f according to the method of Rutar (1989). Analyses were 
conducted with 200-seed samples of intact wild-type and AS1 1 seeds using a 
Bruker AM wide-bore spectrometer (Bruker Analytische Masstechnik GHBH, 

10 Silberstreifen D-76287, Rheinstetten4/Karlstuhe, Germany) operating at 360 MHz. 
To reduce anisotropic line broadening, the seed sample was rotated at 1 kHz in a 
zirconium rotor oriented 54.7° to the magnetic field. The integration response for 
resonances attributable to liquid-like oil were summed and the value for AS1 1 seed 
was recorded relative to the response for the WT control seed sample, the latter set 

15 at a value of 1.00. 

Sterol esters were purified from the TLEs by thin layer chromatography 
(TLC) on Silica H plates developed in hexane: diethyl ether: formic acid (80:20:2, 
v/v/v). After elution from the silica H with chloroform: methanol (2:1, v/v), the sterol 
esters were quantified by saponification followed by methylation of the resulting fatty 

20 acids with 3N methanoiic-HCI. The fatty acid methyl esters (FAMEs) were analyzed 
by GC as described previously (Taylor et a!., 1991). The free sterols released by 
saponification were also analyzed by GC on a 30 m DB-5 column; GC temperature 
program : initial temp: 180°C, increasing at 10°C/min to 300°C and held at this 
temperature for 15 min. The sterol ester content was reported as a % of the TLE; 

25 i.e. FAMEs released from sterol esters calculated as proportion of the FAMEs 
released by transmethylation of the total lipid extract (TLE). 

Construction of Plant Transformation Vector Containing the Wild-Type DGAT 
gene for Over-Expression in WTA. thaliana and Complementation of the A. 
thaliana AS11 Mutant: 

30 

Two primers: 
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Gen 1 (GAGAGGATCCACGCTCACGACCCATTCTTCCCG; [SEQ ID 
NO:21]), and 

Gen 2 (AAGAAGGATCCATCCCCAAAACGGGACCACCAA; [SEQ ID 

NO:22]) 

5 were synthesized according to sequences upstream and downstream of the 

TAG1 gene. These primers were used to PCR amplify a genomic fragment of 5.1 
kb from wild-type A. thaliana. The PCR fragment was purified and digested with 
BamHI and inserted into the corresponding site in plasmid pRD400 (Datla et at. 
1992) to generate the plant transformation vector DGATg-pRD400. 

1 0 Construction of DGAT cDNA Plant Transformation Vector for Seed-Specific 
Expression: 

The cloned full-length DGAT cDNA was used as a template for PCR 
amplification with the primers DGATXbal {CTAGTCTAGAATGGCGATTTTGGA; SEQ 
ID NO:12) and DGATXhol (GCGCTCGAGTTTCATGACATCGA; SEQ ID NO: 13) to 

15 provide new restriction sites on each end of the sequence. The PCR profile was as 
follows: 94°C 1 min; 30 cycles of 94°C 30 seconds, 55°C 30 seconds, 72°C 1 min; 
and 72°C 5 min. The PCR product was then ligated into the PCR-2.1 vector 
(InVitrogen). A 1.6kb fragment was excised by a Xbai/Kpnl digestion and ligated 
into the corresponding sites of the pSE129 vector (provided by Dr. P. Covello, 

20 PBI/NRC). pSE129A is a vector derived from the plant transformation vector 
pRD400 (Datla et aL 1992). The vector pSE129A contains the seed-specific napin 
promoter and the nos terminator cloned into the EcoRI and Hindlll sites of the 
pRD400 plasmid (See Figure 9). Hence in the DGAT-pSE129A construct the 
Arabidopsis DGAT cDNA is under the control of the napin promoter. The construct 

25 integrity was confirmed by sequencing. 

Transformation of Agrobacterium with Plant DGAT Vector Constructs: 

Electrocompetent Agrobacterium cells, GV3101 (pMP90) strain, were 
prepared as follows: An Agrobacterium culture was grown 24 to 48 hrs in 2YT, and 
when the absorbance at 600 nm reached 0.5 to 0.7, the cells were chilled on ice 
30 and pelleted by centrifugation (5,000 x g, 10 min in a GSA rotor at 4°C). The pellet 
was washed in 1 , 0.5 and 0.02 volumes of cold 10% sterile glycerol and 
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resuspended in 0.01 volume of cold 10% glycerol. The electrocompetent cells were 
then frozen in liquid N 2 and stored at -70°C. The Agrobacterium cells were 
transformed by electroporation with 20-50 ng of transforming DNA (either DGATg- 
pRD400 or DGAT-pSE129A) according to the manufacturer's instructions, plated on 
5 a selective medtum (LB with 50 fig/mL kanamycin) and incubated overnight at 28°C. 
Single transformed cells were grown overnight (28°C, 225 r.p.m.) in 5 mL LB with 
50 |ig/mL Kanamycin and 25 jag/mL Gentamycin, DNA extraction and purification 
were performed with a Qiaprep Spin Miniprep kit (Qiagen). The fidelity of the 
construct was re-checked by DNA sequencing before plant transformation. 

10 Transformation of Arabidopsis thaliana: 

The transformation protocol was adapted from that described by Clough and 
Bent (1998). Seeds of Arabidopsis thaliana ecotype Columbia and mutant AS1 1 
(Katavic et a/., 1995) were grown at 22°C under fluorescent illumination (120 jaE-m" 
2 -s" 1 ) in a 16 h light/8 hour dark regime. Typically, four to six plants were raised in a 

15 10 cm 2 pot in moistened Terra-lite Redi-earth (W. R. Grace & Co. Canada Ltd. Ajax, 
ON, Canada). To prevent the soil mix in the pot from failing into the inoculation 
media, soil was mounded as a platform with seeds sown on top, and the whole pot 
covered by a nylon window screen and secured by a rubber band. Plants were 
vacuum infiltrated in an Agrobacterium suspension when the first flowers started 

20 opening. 

To grow Agrobacterium, a 5 mL suspension in LB medium containing 50 
ng/mL kanamycin and 25 |ag/mL gentamycin was cultured overnight at 28°C. The 
day before infiltration, this "seed culture" was divided into four flasks containing 250 
mL of LB medium supplemented with 50 jig/mL kanamycin and 25 jig/mL 

25 gentamycin. These culture were grown overnight at 28°C. The next morning after 
the absorbance at 600 nm was checked (approximately = 1 .0), the cells were 
harvested by centrifugation (5,000 x g, 10 min in a GSA rotor at room temperature) 
and resuspended in the infiltration medium (sucrose 5%; Silwet-77 0.005% in water) 
to obtain an optical density at 600 nm of 0.8. The Agrobacterium suspension was 

30 then poured into a beaker and the potted plants inverted into the beaker so that the 
flowers and bolts were submerged. The beaker was then placed into a large Bell jar 
and a vacuum drawn using a vacuum pump, until bubbles formed on the stem 
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surfaces and the solution started to bubble slightly, and then the vacuum was 
released rapidly. [Note: The necessary time and pressure will vary from one iab 
setup to the next, but good infiltration is visibly apparent as uniformly darkened, 
water-soaked tissue.] Pots were removed from the beaker, laid on their side in a 
5 piastic tray and covered with a plastic dome, to maintain humidity. The following 
day, the plants were uncovered, set upright and allowed to grow for approximately 
four weeks in a growth chamber under continuous light conditions as described by 
Katavic et al., (1995). When the siliques were mature and dry, seeds were 
harvested and selected for positive transformants. 

1 0 Selection of Putative Transformants (Transgenic plants) and Analysis of 
Transgenic Plants: 

For each construct, seeds were harvested in bulk. Seeds were surface- 
steriiized by submerging them in a solution containing 20% bleach and 0.01% Triton 
X-100 for 20 rnin, followed by three rinses with sterile water. Sterilized seeds were 

15 then plated by resuspending them in sterile 0.1% phytagar at room temperature 
(about 1 mL phytagar for every 500-1000 seeds), and then applying a volume 
containing 2,000-4,000 seeds onto 150 x 15 mm kanamycin selection plate. Plates 
were incubated for 2 days in the cold without light, and then grown for 7-10 days in 
a controlled environment (22°C under fluorescent illumination (120 jnE-m" 2 «s" 1 ) in a 

20 16 h light/8 hour dark regime). The selection media contain Vi MSG medium, 0.8% 
phytagar, 3% sucrose, 50ug/mL kanamycin and 50ug/ mL Timentin. Petri dishes 
and lids were sealed with a Micropore™ surgical tape tape (3M Canada, London, 
ON, Canada). After 7-10 days, drug-resistant plants that had green leaves and well 
established roots within the medium were identified as transformants and at the 3-5 

25 leaf stage, selected transformants were transplanted into flats filled with heavily 
moistened soil mix. Transformants were grown to maturity and mature seeds (T 2 
generation as defined in Katavic et al., (1994)) were harvested from 
individualpiants, and further analyzed. 

DNA isolation from and analysis of Transformants 

30 Genomic DNA was isolated from individual plants following the protocol of 

Dellaporta et a/., (1983). A PCR amplification using the paired primers described 
previously for the DGAT cDNA or for the DGAT gene, was performed to confirm the 
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presence of the cDNA or the gene, respectively, in the T 1 transformants. Southern 
analyses (Southern, 1975) were performed to select the transformants containing a 
single copy of the inserted fragment. DNA samples were digested with restriction 
enzymes (Bgl ll for the DGAT cDNA and Eco R! for the DGAT gene), resolved by 
5 electrophoresis on a 1% agarose gel, and Southern blotting performed using a 
nylon filter (Hybond-N+, Amersham) according to Sambrook et a/. (1989). The 
DGAT cDNA fragment, labelled with a-[ 32 P] dCTP (NEN/DuPont) using the Random 
Primer DNA labelling kit (Gibco BRL), was used as a probe. Hybridization was 
performed at 60°C according to Church and Gilbert (1984). The filter was then 
1 0 exposed to Kodak X-OMAT-AR film. 

DEPOSIT INFORMATION 

The following biological material has been deposited at the American Type 
Culture Collection (ATCC) of 10801 University Boulevard, Manassas, Virginia, 
201 1 0-2209, U.S.A. All of these deposits were made on behalf of the 
15 Applicant/Assignee (National Research Council of Canada) under the terms of the 
Budapest Treaty on the dates indicated, and have been given the accession 
numbers shown below. 

Deposited Material Date of Deposit Accession No. 

Arabidopsis DGAT gene November 29, 1999 PTA-988 
20 Arabidopsis DGAT cDNA November 29, 1999 PTA-989 

Arabidopsis ASH seeds December 3, 1999 PTA-1013 

The deposit receipts are shown later in this description. 

25 SEQUENCE LISTING FREE TEXT 

The Sequence Listing provided below contains free text entries in respect of 
SEQ ID NOs: 12 to 22. The free text used in the Sequence Listing is repeated as 
follows: 

SEQ ID NO: 12 Primer of DGATXbal 

30 SEQ ID NO: 13 Primer of DGATXhol 
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SEQ 


ID NO:14 


Primer DGAT1 


SEQ 


ID NO:15 


Primer DGAT2 


SEQ 


ID NO:16 


Primer DGAT3 


SEQ 


ID NO:17 


Primer DGAT 4 


SEQ 


ID NO:18 


Primer A 


SEQ 


ID NO:19 


Primer B 


SEQ 


ID NO:20 


Primer C 


SEQ 


ID NO:21 


Primer Gen 1 


SEQ 


ID NO:22 


Primer Gen 2. 



10 

A summary of all of the listed sequences is provided below for ease of 
review: 

SEQ ID NO:1 - (pDGAT; vector containing isolated and purified deoxyribonucleic 

acid cDNA; ATCC No PTA-989), Genbank/EMBL Accession No. AJ238008. 

15 SEQ ID NO:2 - The deduced amino acid sequence of the Arabidopsis DGAT 
(AtTAGI) protein. 

SEQ ID NO:3 - (pgenomic DGAT; vector containing isolated and purified genomic 
deoxyribonucleic acid (genomic DNA) ATCC No PTA-988). 

SEQ ID NO:4 - MDGAT, mouse DGAT [GenBank/EMBL Accession No. AF078752 
20 (Cases ef a/., 1998)]. 

SEQ ID NO:5 - HARGP1, human ARGP1 protein [GenBank/EMBL Accession No. 
AF059202; OelkersetaL, 1998]. 

SEQ NO:6 - Arabidopsis thaliana expressed sequence tag (EST) [accession no. 
AA042298). 

25 SEQ ID NO:7 - A diacylglycerol/phorbol ester-binding motif found in SEQ ID NO:2, 
SEQ ID NO:8 and SEQ ID NO:9 ( 414 HKWMVRHIYFP 424 ). 

SEQ ID NO:8 - B. napus DGAT amino acid sequence GenBank EMBL Accession 
No AF1 55224. 
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SEQ ID NO:9 - B. napus DGAT amino acid sequence GenBank/EMBL Accession 
No. AF164434. 

SEQ ID NO: 10 - Targeting motif typical of members of the SnRK1 protein kinase 

family found in SEQ ID NQ:2, SEQ ID NO:8 and SEQ ID NO:9 X-L^-X-K 202 - 
5 X-X-S^-X-X-X-V 209 

SEQ ID NO:1 1 - A 27 amino acid insertion repeat in SEQ ID NO:2 found in the 
Arabidopsis thaliana AS1 1 mutant. 

131 SHAGLFNLC\AA/LIAVNSRLIIENLMK 157 

SEQ IN NO: 12 - CTAGTCTAGAATGGCGATTTTGGA ( nucleotide sequence of 
10 Primer DGATXbal ). 

SEQ ID NO:13 - GCGCTCGAGTTTCATGACATCGA ( nucleotide sequence of 
Primer DGATXhol). 

SEQ ID NO: 14 - AGACACGAATCCCATTCCCACCGA ( nucleotide sequence of 
Primer DGAT1 ). 

15 SEQ ID NO:15 - AGTGGTGACAACGCAGGGATGATG ( nucleotide sequence of 
Primer DGAT2), 

SEQ ID NO: 16 - ATGGTCGCTCCCACATTGTGT ( nucleotide sequence of Primer 
DGAT3). 

SEQ ID NO: 17 - CATACAATCCCCATGACATTTATCA ( nucleotide sequence of 
20 Primer DGAT4). 

SEQ ID NO: 18 - CGACCGTCGGTTCCAGCTCATCGG ( nucleotide sequence of 
Primer A). 

SEQ ID NO:19 - GCGGCCAATCTCGCAGCGATCTTG (nucleotide sequence of 
Primer B). 

25 SEQ ID NO:20 - TAAACAGTAGACTCATCATCG (nucleotide sequence of 
Primer C). 
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SEQ ID NO:21 - GAGAGGATCCACGCTCACGACCCATTCTTCCCG (nucleotide 
sequence of primer Gen 1 ). 

SEQ ID NO:22 - AAGAAGGATCCATCCCCAAAACGGGACCACCAA 

(nucleotide sequence of primer Gen 2). 

5 SEQ ID NO:23 - AS1 1 mutant DGAT cDNA nucleotide sequence. 

SEQ ID NO:24 - AS1 1 mutant DGAT genomic DNA nucleotide sequence. 

SEQ ID NO:25 - the deduced amino acid sequence of SEQ ID NO:23. 
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ATCC 



10101 Univenity Bhrd • Mamsm, VA 201 10-2209 • Telephone: 703-365-2700 • FAX: 703- 

BUBAPEST TPHATY ON T~y INTERN -KTZOV* T PJZCO<INTnO T T C" 
THE DEPOSIT OF MICROORGANISMS FOR THE PURPOSES OF PATENT PROCEDURE 



INTERNATIONAL FORM 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT ISSUED PURSUANT TO RULE 7 J 
AND VIABILITY STATEMENT ISSUED PURSUANT TO RULE 1 0.2 



RECEIVLD 
KEGB 



DEC 1 0 1999 



ENTERED. 



CHECKED. 



To: (Name and Address of Depositor or Attorney) 

NRC Canada 
Attn: David C Taylor 
Plant Biotechnology Institute 
110 Gymnasium Place 
Saskatoon, SK S7N 0W9 Canada 

Deposited on Behalf of: National Research Council of Canada 

Identification Reference by Depositor: Patent Deposit Designation 

Arabidopsis DG AT $a\t PTA-9S8 
Arabldopsis DGAT cDNA PTA-989 

The deposits were accompanied by: a scientific description X a proposed taxonomic description indicated above. The 
deposits were received November 29. 1999 by this International Depository Authority and have been accented. 

AT YOUR REQUEST:, X Wc will not inform you of requests for the strains. 

The strains will be made available if a patent office signatory to the Budapest Treaty certifies one's rigjit to receive, or if a US. 
Patent is issued citing the strains, and ATCC is instructed by the United States Patent & Trademark Office or the depositor to 
release said strains. 

If the cultures should die or be destroyed during the effective term of the deposit, it shall be your responsibility to replace them 
with living cultures of the same. 

The strains will be maintained for a period of at least 30 years from date of deposit, or five years alter the most recent request for 
a sample, whichever is longer. The United States and many other countries are signatory to the Budapest Treaty. 

The viability of the cultures cited above was tested December'2. 1999 On that date, the cultures were viable. 

International Depository Authority: American Type Culture Collection, Manassas, VA 20U0-2209 USA. 

Signature of person having authority to represent ATCC: 




a having authority to represent , 



^/tA .sC^rzLrJd^t^i / Date: December 3, 1999 



M. Hafley, Administrator, Patent Depository 
ce: Kirby.Eades, Gale and Baker 
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BUDAPEST TREATY ON THE INTERNATIONAL RECOGNITION OF 
THE DEPOSIT OF MICROORGANISMS FOR THE PURPOSES OF PATENT PROCEDURE 

INTERNATIONAL FORM 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT ISSUED PURSUANT TO RULE 7.3 
AND VIABILITY STATEMENT ISSUED PURSUANT TO RULE 10 2 

To: (Name mnd Address of Depositor or Attorney) 

Nations! Research Council of Canada 
Attn: David C.Taylor 
Plant Biotechnology Institute 
110 Gymnasium Place 
Saskatoon, SKS7NOW9 
Canada 

Deposited on Behalf oft National Research Council of Canada 

Identification Reference by Depositor: Patent Deposit Designation 

AmbidopsisASU % FTA-1QI3 

The seeds were accompanied by: _ a scientific description X » proposed texonomic description indicated above, 
The seeds were received December 3. 1999 by this International Depository Authority and have been accepted. 

AT YOUR REQUEST: X. We wfl * not Worm you of requests for ibe seeds.. 

The seeds will be made available if a patent office signatory to the Budapest Treaty certifies one's right to receive, 
or if a VS. Patent is issued citing the seeds and ATCC is instructed by the United States Patent & Trademark Office 
or die depositor to release said seeds. 

If the seeds should die or be destroyed during the effective term of the deposit, it shall be your responsibility to 
replace diem with viable aeeds of the same. 

The seeds will be maintained fbr a period of at least 30 yetn from date of deposit, or five yean after the most recent 
request for a sample, whichever is longer. The United States and many other countries are signatory to the Budapest 
Treaty. 

The viability of the seeds eited above was tested December 23^ 1999. On that date, the seeds were viable* 
International Depository Authority: American Type Culture Collection, Manassas. VA 201 10-2209 USA. 
Simature of person having authority to represent ATCC: 

>J4U^^^'^ SZ^AjlM** Date: January ?,y>00 



Barbara M. Bailey, Administrator, Patent Dtfcoritary 
cc: Kitby. Eades et aL 
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CLAIMS: 

1 . Isolated and purified deoxyribonucleic acid (DMA), characterized In that 
said DNA includes a sequence according to SEQ ID NO;1 or a part of SEQ ID N0:1, or 
a sequence that Is substantially homotogoua to SEQ ID NO:l. 

5 2. Isolated and purified deoxyribonucleic acid (DNA), characterized in that 
said DMA includes a sequence according to SEQ ID NO:3 or a part of SEQ ID NO:3, or 
a sequence that Is substantially homologous to SEQ ID NO:3. 

3. A vector that contains a nucleic acid sequence according to SEQ ID 
NO;i, or a part of SEQ ID NO:1, or a sequence that is substantially homologous to 

10 SEQ ID NO:1. 

4. A vector that contains a nucleic acid sequence according to SEQ ID 
NO:3, or a part of SEQ ID NO:3, or a sequence that is substantially homologous to 

SEQ ID NO:3. 

s. A vector that contains a deoxyribonucleic acid sequence according to 

15 SEQIDNO'23. 

q. A vector according to claim 3 or claim 5, characterized In that said 
sequence Is present In said vector in a sense orientation. 

7. A vector according to claim 3, characterized In that said sequence is 
present In said vector In an anti-sense orientation. 
20 B. PteemldpDGATcDNA(ATCCPTA-989). 

9. Plaamid pDGATgene (ATCC PTA-BBS). 

1 o. A plant having a genome, characterized in that the genome contains an 
introduced recombinant nucleotide sequence of SEQ ID NO:1 , or a part of SEQ ID 
IMO:1 , or a sequence that is substantially homologous 1© SEQ iD NO:1. 

25 11, a plant seed having a genome, characterized in that said genome 
contains an introduced recombinant nucleotide sequence of SEQ ID NO:t or a pan of 
SEQ ID NO:1, or a sequence that is substantially homologous to SEQ ID NO:1. 
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12. A genetically transformed plant, characterized in that said plant has a 
genome that has been transformed by a vector according to claim 3 or claim 4 or 
claim 5. 

13. A genetically transformed plant seed, characterized in that said seed 
5 has bean transformed by a vector according to claim 3 or claim 4 or claim 5. 

14. A plant seed as claimed In Claim 11 orClaltm 13, characterized by 
exhibiting an altered seed oil content compared to the seed oil content of seeds of 
genomicflliy-unmodrfied plants of the same genotype grown in identical conditions at 
the same time. 

10 15. A plant seed ae claimed in CJalm 1 1 or Claiml 3, characterfeed by 
exhibiting an altered diacylglycerol content In Its seed oil compared to the 
dlacylglycerol content of seeds of genomleally-unmodified plants of the same genotype 
grown In Identical conditions at the 6ame time. 

16. A plant seed as claimed In 1 1 or 13. characterized by exhibiting a seed 
15 oil with an altered fatty acyl composition compared to the fatty acid composition of 
plants of the same genotype grown in identical condition* at the same time. 

17 A plant a* claimed In 10 or 12, characterized by exhibiting an enhanced 
biomass compared to the biomass of genomleally-unmodified plants of the same 
genotype grown In Identical conditions at the same time. 

20 1 8. A seed as claimed in 1 1 or 1 3, characterized by exhibiting an enhanced 
blomass compared to the biomass of genomically-unmodifled plants of the same 
genotype grown under identical conditions at the same time. 

1 g, A method of producing transgenic plants by Introducing a recombinant 
nucleotide sequence into a genome of said plant, characterized in that said nucleotide 
25 sequence introduced into said genome includes SEQ ID NO;1. SEQ ID NO.3 or SEQ 
ID NO:23; or a part of S6Q ID NO: 1 , SEQ ID WO:3 or SEQ ID NO:23; or a sequence 
that is substantially homologous to SEQ ID NQ:1, SEQ ID NO:3 or SEQ ID NO:23. 

20. A method according to claim 10, characterized In that said plant Is a 
member of the Brawsicaceae. 
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21. A method according to claim 1 8, characterized In that said plant Is 
selected from the group consisting of Arabidopsis thaliana, borage {Borago app.) t 
Canola, castor (Ricinua communis), cocoa bean (TheobromQ cacao), com (Zea mays), 
cotton (Gossypium app), Crambo spp., Cuphea spp., flax (Unum spp.), Lesquerelfa 

5 and Llmnanihes app M Linola, nasturtium (Trop&eoiurn app-), Oenothera spp.. olive 
(O/aa spp ), palm (Sae/sspp.), peanut iAracftis spp.) ( rapeaeed, safflower (Carthamus 
app,), soybean (Glycine and Soya spp->, sunflower (Hellanthus app.), tobacco 
{Nicotians app )„ Vemoma spp., wheat (TrHieum spp,), barley (Hordloum spp.), rice 
(O/yza app.), oat <Avens app.) sorghum (Sorghum app ). rye (Seca/e app.) and other 
10 members of the Gramineae* 

22. An Isolated plant DNA sequence or part thereof, characterized in that 
the sequence is substantially homologous to at least a part of SEQ JD NO:1 or SEQ ID 

NO;3. 

15 23. A method of changing the oil content, acyl composition or dlacytglycerol/ 
trlacyiglycerol proportions of the seed oil of plant seeds by Introducing a sense or anti- 
sense recombinant nucleic acid construct Into a plant transformation vector, using the 
vector to transform the genome of a plant or plant seed, and then growing the plant or 
plant seed and extracting the oil from the plant seed, characterized in that said 

20 recombinant nucleic acid sequence comprises SEQ ID NO:1 p SEQ !D NO;3 or SEQ IO 
NO;23, or a part of SEQ ID NO, 1 , SEQ ID NQ: 3 or SEQ ID NO:23; or a sequence that 
is substantially homologous to SEQ ID NO:1 , SEQ ID NO;3 or SEQ ID NO:23, 
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<400> 1 

atttcttagc ttcttccttc aatccgctct ttccctctcc attagattct gtttcctctt 60 
tcaatttctt ctgcatgctt ctcgattctc tctgacgcct cttttctccc gacgctgttt 120 
cgtcaaacgc ttttcgaaat ggcgattttg gattctgctg gcgttactac ggtgacggag 180 
aacggtggcg gagagttcgt cgatcttgat aggcttcgtc gacggaaatc gagatcggat 24 0 
tcttctaacg gacttcttct ctctggttcc gataataatt ctccttcgga tgatgttgga 300 
gctcccgccg acgttaggga tcggattgat tccgttgtta acgatgacgc tcagggaaca 36 0 
gccaatttgg ccggagataa taacggtggt ggcgataata acggtggtgg aagaggcggc 420 
ggagaaggaa gaggaaacgc cgatgctacg tttacgtatc gaccgtcggt tccagctcat 480 
cggagggcga gagagagtcc acttagctcc gacgcaatct tcaaacagag ccatgccgga 54 0 
ttattcaacc tctgtgtagt agttcttatt gctgtaaaca gtagactcat catcgaaaat 600 
cttatgaagt atggttggtt gatcagaacg gatttctggt ttagttcaag atcgctgcga 66 0 
gattggccgc ttttcatgtg ttgtatatcc ctttcgatct ttcctttggc tgcctttacg 720 
gttgagaaat tggtacttca gaaatacata tcagaacctg ttgtcatctt tcttcatatt 780 
attatcacca tgacagaggt tttgtatcca gtttacgtca ccctaaggtg tgattctgct 840 
tttttatcag gtgtcacttt gatgctcctc acttgcattg tgtggctaaa gttggtttct 900 
tatgctcata ctagctatga cataagatcc ctagccaatg cagctgataa ggccaatcct 96 0 
gaagtctcct actacgttag cttgaagagc ttggcatatt tcatggtcgc tcccacattg 102 0 
tgttatcagc caagttatcc acgttctgca tgtatacgga agggttgggt ggctcgtcaa 1080 
tttgcaaaac tggtcatatt caccggattc atgggattta taatagaaca atatataaat 1140 
cctattgtca ggaactcaaa gcatcctttg aaaggcgatc ttctatatgc tattgaaaga 1200 
gtgttgaagc tttcagttcc aaatttatat gtgtggctct gcatgttcta ctgcttcttc 1260 
cacctttggt taaacatatt ggcagagctt ctctgcttcg gggatcgtga attctacaaa 1320 
gattggtgga atgcaaaaag tgtgggagat tactggagaa tgtggaatat gcctgttcat 13 8 0 
aaatggatgg ttcgacatat atacttcccg tgcttgcgca gcaagatacc aaagacactc 1440 
gccattatca ttgctttcct agtctctgca gtctttcatg agctatgcat cgcagttcct 1500 
tgtcgtctct tcaagctatg ggcttttctt gggattatgt ttcaggtgcc tttggtcttc 1560 
atcacaaact atctacagga aaggtttggc tcaacggtgg ggaacatgat cttctggttc 162 0 
atcttctgca ttttcggaca accgatgtgt gtgcttcttt attaccacga cctgatgaac 1680 
cgaaaaggat cgatgtcatg aaacaactgt tcaaaaaatg actttcttca aacatctatg 1740 
gcctcgttgg atctccgttg atgttgtggt ggttctgatg ctaaaacgac aaatagtgtt 18 00 
ataaccattg aagaagaaaa gaaaattaga gttgttgtat ctgcaaaaat tttggtagag 1860 
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acacgcaaac ccgtttggat tttgttatgg tgtaaagcgg ccgc 1904 

<210> 2 
<211> 520 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ala He Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 
1 5 10 15 

Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 
20 25 30 

Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 
35 40 . 45 

Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg He Asp 
50 55 60 

Ser Val Val Asn Asp Asp Ala Gin Gly Thr Ala Asn Leu Ala Gly Asp 
65 70 75 80 

Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 
85 90 95 

Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 
100 105 no 

Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala He Phe 
115 120 125 

Lys Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu He 
130 135 140 

Ala Val Asn Ser Arg Leu He He Glu Asn Leu Met Lys Tyr Gly Trp 
14 5 150 155 160 

Leu He Arg Thr Asp Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp 
165 170 175 

Pro Leu Phe Met Cys Cys He Ser Leu Ser He Phe Pro Leu Ala Ala 
180 185 190 

Phe Thr Val Glu Lys Leu Val Leu Gin Lys Tyr He Ser Glu Pro Val 
195 200 205 

Val He Phe Leu His He He He Thr Met Thr Glu Val Leu Tyr Pro 
210 215 220 

Val Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr 
225 230 235 240 

Leu Met Leu Leu Thr Cys He Val Trp Leu Lys Leu Val Ser Tyr Ala 
245 250 255 

His Thr Ser Tyr Asp He Arg Ser Leu Ala Asn Ala Ala Asp Lys Ala 
260 265 270 
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Asn Pro Glu Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe 
275 280 285 

Met Val Ala Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Ala 
290 295 300 

Cys He Arg Lys Gly Trp Val Ala Arg Gin Phe Ala Lys Leu Val He 
305 310 315 320 

Phe Thr Gly Phe Met Gly Phe He He Glu Gin Tyr He Asn Pro He 
325 330 335 

Val Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala He 
340 345 350 

Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys 
355 360 365 

Met Phe Tyr Cys Phe Phe His Leu Trp Leu Asn He Leu Ala Glu Leu 
370 375 380 

Leu Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys 
385 390 395 400 

Ser Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp 
405 410 415 

Met Val Arg His He Tyr Phe Pro Cys Leu Arg Ser Lys He Pro Lys 
420 425 430 

Thr Leu Ala He He He Ala Phe Leu Val Ser Ala Val Phe His Glu 
435 440 445 

Leu Cys He Ala Val Pro Cys Arg Leu Phe Lys Leu Trp Ala Phe Leu 
450 455 460 

Gly He Met Phe Gin Val Pro Leu Val Phe He Thr Asn Tyr Leu Gin 
465 470 475 480 

Glu Arg Phe Gly Ser Thr Val Gly Asn Met He Phe Trp Phe He Phe 
485 490 495 

Cys He Phe Gly Gin Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu 
500 505 510 

Met Asn Arg Lys Gly Ser Met Ser 
515 520 



<210> 3 
<211> 5193 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 3 

gctcacgacc cattcttccc gttccatttg gttttattta tttcaaagtt taatattcct 60 
tttgtataac attcaaatct tcacatgatt gattgtgtga aaaccccaca gattttacta 120 
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caataggggg agttgactta aaatagctat tgatgtcgaa aaaatgtatt ttagttataa 180 
attatactaa agaaaatttt tgatttgtct gttgtttaag catatgtatt gttaaactta 240 
aaaaaatatg tattgttaat cttaaaaatg taggagtaca catcaaatac tcgagcataa 300 
tcaaaaccgt attcatagac cgatgtgaga atcaaataga agataatgtg attttttaaa 360 
atatcgtatc tccaaatcaa tcacttagaa gataatgtaa ttctttatgt gctacataaa 420 
taaatatata tatatatata tatatatatc ttgtatatat gtcttgacaa aaaattgcca 480 
gtcaaaaacc atgactgaat caaactataa gtcggattga atcaaactat aagtcggatg 54 0 
agtattaatt tccattatgt ttctatactt tacaaaccgg aaaatagata ttatagatac 600 
caaaaaagta gatttgtgta tattattaga agatttggaa tttcatcatt atcaggatct 660 
aaagtacttc cctaattaaa tcatgtcggt tgaaaaagct caatgaatgt ttgaaatttg 72 0 
gaaagtttat taaattcgga tctttttttt ttgtttgtcg tcccaaacat ttttatttta 780 
ttacaaataa tcaacttatc cttactacta aatcatttca tatctttgat accaacaaat 840 
catttcatat tctattttga tgtttaagaa aacactattt accagttaca aaatattata 900 
aggattgttg tttagaaaaa aaagtacaag ttgaattctt tttgtcaaat ataaaattga 960 
ctttttaata tataattgac ttattgaaca tgattacaga attaatcatc tacaaaactt 1020 
tccaagttta taataaatac atttcaaaga ctattagttc ttcttaaaat atttctaaaa 1080 
gtgatcaaag actaccacat ataattcaga aaaagtagaa gttgatttct ttttgtcaaa 1140 
taaataattg acttaaaata gtttggaaag ccattgaact tgattataga attgataatg 1200 
tacataaaaa aattccaagt ttataataaa tacatttttc aaatgctata tcagttcttc 1260 
ttaaaatatt tcactaaaaa aacactcaaa tatagaataa atttattgaa taacatacca 1320 
actgtaaaac agaatttgac aaaaaaaaaa aaaaaatgaa atgaagatga agacaaaaat 13 80 
aaatcaccag aggatcttat gcaaaaaaat atatgaatac acaataaacc atattgatat 1440 
ttttaaaata aaataaaaac agaaaaatat cccaacaccg cttttcaatt aaaaatcttc 1500 
cgtcaccatt gttgtcatct tcctctctcg tgaatccttt ttcctttctt cttcttcttc 1560 
tcttcagaga aaactttgct tctctttcta taaggaacca gacacgaatc ccattcccac 1620 
cgatttctta gcttcttcct tcaatccgct ctttccctct ccattagatt ctgtttcctc 1680 
tttcaatttc ttctgcatgc ttctcgattc tctctgacgc ctcttttctc ccgacgctgt 1740 
ttcgtcaaac gcttttcgaa atggcgattt tggattctgc tggcgttact acggtgacgg 1800 
agaacggtgg cggagagttc gtcgatcttg ataggcttcg tcgacggaaa tcgagatcgg 1860 
attcttctaa cggacttctt ctctctggtt ccgataataa ttctccttcg gatgatgttg 1920 
gagctcccgc cgacgttagg gatcggattg attccgttgt taacgatgac gctcagggaa 1980 
cagccaattt ggccggagat aataacggtg gtggcgataa taacggtggt ggaagaggcg 204 0 
gcggagaagg aagaggaaac gccgatgcta cgtttacgta tcgaccgtcg gttccagctc 2100 
atcggagggc gagagagagt ccacttagct ccgacgcaat cttcaaacag gtttaaaatc 216 0 
tcagaaatct tcgaatttgg tgtttgcttg ttgttttata tggaattgag tttggtgatt 2220 
gttttgcatt gcagagccat gccggattat tcaacctctg tgtagtagtt cttattgctg 2280 
taaacagtag actcatcatc gaaaatctta tgaaggtttg ctgttacttg tttctccttt 2340 
taggaattga attgcttgaa aatttatcag agacgaataa ctttgttgtt gctatcattc 2400 
atgtagtatg gttggttgat cagaacggat ttctggttta gttcaagatc gctgcgagat 246 0 
tggccgcttt tcatgtgttg gtaaaagaag atgtttttta tttccagcaa tgttacattg 2520 
ttatacgtat aatgatgagt ttagtgatca agttcctctt tgattcttct ttcttgttgc 2580 
agtatatccc tttcgatctt tcctttggct gcctttacgg ttgagaaatt ggtacttcag 2640 
aaatacatat cagaacctgt gagtaattac tattctccag ccattactgt aatttttatt 2700 
gaagacaagt ttgtatcatg aagaacttac aagttctgtt ttgaaaatgc tcaaggttgt 2760 
catctttctt catattatta tcaccatgac agaggttttg tatccagttt acgtcaccct 2820 
aaggtgatac tgtttttctg gtctcagttt gtgatactgt ttttaagttt agttgtctga 2880 
cccggtgatc ttgaaaatgg acaggtgtga ttctgctttt ttatcaggtg tcactttgat 2940 
gctcctcact tgcattgtgt ggctaaagtt ggtttcttat gctcatacta gctatgacat 3 000 
aagatcccta gccaatgcag ctgataaggt aaaatacgaa aaagaagcgt atgtattagt 3060 
cacttgcact gtgttactgt tttaaccaaa cactgttatg aactttaggc caatcctgaa 312 0 
gtctcctact acgttagctt gaagagcttg gcatatttca tggtcgctcc cacattgtgt 318 0 
tatcaggtaa ctgcaaagtg catcaaccat tcttatactt gcaagagttt cttgtctaaa 3240 
cctcggatct ttgcttttcc ccagccaagt tatccacgtt ctgcatgtat acggaagggt 3300 
tgggtggctc gtcaatttgc aaaactggtc atattcaccg gattcatggg atttataata 3360 
gaacaagtac gttttcacat cttgctttat tagttttcct tggtgaaaat catcatccct 3420 
gcgttgtcac cacttgactt catgttcttt tgttacattt tggcagtata taaatcctat 3480 
tgtcaggaac tcaaagcatc ctttgaaagg cgatcttcta tatgctattg aaagagtgtt 3540 
gaagctttca gttccaaatt tatatgtgtg gctctgcatg ttctactgct tcttccacct 3600 
ttggtatgct gtgatcccat ctctttcaaa ataatttgca aattcgaaaa accgaaaaag 3660 
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gctaaatctc atacgaattt gatattttta 
tactgaacgc aaatctcttg tccaaaggtt 
ggatcgtgaa ttctacaaag attggtggaa 
tactcaaaag aaaacttatg atttttaatg 
caaattcatg tattcactgt cttcctttat 
ttctcttcct aaacatcacc ttcttttgta 
tcttgttttc cttgacagcc tgttcataaa 
ttgcgcagca agataccaaa ggtgagtgag 
gtttctgtga tataaattta accctccaca 
ttgctttcct agtctctgca gtctttcatg 
tctagacgca tgaacacacg ctagtgaaag 
ttaacgatct tgtgttacaa atttcctttt 
tcttcaagct atgggctttt cttgggatta 
gcagtcgatt tttactaaac tctaatctca 
tgcctttggt cttcatcaca aactatctac 
aaaacccgag aaaatagaac gaataactct 
atgctgaaac ttaataataa aggtgatctg 
gaacatgatc ttctggttca tcttctgcat 
ttaccacgac ctgatgaacc gaaaaggatc 
ctttcttcaa acatctatgg cctcgttgga 
taaaacgaca aatagtgtta taaccattga 
tgcaaaaatt ttggtagaga cacgcgaacc 
ttcaatcaaa aaactgttgt aataattgtt 
gggaaaaata gtagttttgt taggttttac 
tgtaataagg aaaaaaaaag aacaaatgtg 
caataaagtt gattggtggt cccgttttgg 



gtttcttaga gtcggtgatg taatttcagt 3720 
aaacatattg gcagagcttc tctgcttcgg 3780 
tgcaaaaagt gtgggagatg tgagctattt 3 840 
ttgtcgttgt ttttgggtca tctaactaac 3900 
cagtactgga gaatgtggaa tatggtatgg 39 60 
cacaaaatag aagaagagag ctaattaaga 4 02 0 
tggatggttc gacatatata cttcccgtgc 4080 
atatataccg atatgcaatt gtcgagattt 4140 
cacttgtttt tcagacactc gccattatca 4200 
aggtatacat actttctaca ttgccctgtc 4260 
aaatgctaat attcaaagca ttgtttttac 4320 
gacagctatg catcgcagtt ccttgtcgtc 4380 
tgtttcaggt taaaaaatta ctaaactgct 4440 
tattctgacc aaccaatttg tttgagtagg 4500 
aggaaaggtt tggctcaacg gtatgctctc 4560 
ttctttcata gcctagccat ttaaatcgca 4620 
ttttggaatg ggatcatatt attaggtggg 468 0 
tttcggacaa ccgatgtgtg tgcttcttta 4740 
gatgtcatga aacaactgtt caaaaaatga 4800 
tctccgttga tgttgtggtg gttctgatgc 4860 
agaagaaaag aaaattagag ttgttgtatc 492 0 
cgtttggatt ttgttatggt gtaaagaaat 4980 
accaaaaaga aatgcttttc tggaaacgag 5040 
tgtttggacc aaatctagta aaaaactttt 5100 
ataaatgcat ggggattgta tgaaaccttc 5160 
gga 5193 



<210> 4 
<211> 498 
<212> PRT 
<213> mouse 

<400> 4 

Met Gly Asp Arg 
1 

Ser Arg Val Ser 
20 

Glu Val Arg Asp 
35 

Ala Pro Ala Pro 
50 

Arg Thr Ser Val 
65 

Gin Asp Ser Leu 



lie Leu Asn Trp 
100 

Phe Leu Glu Asn 
115 

Val Val Ser Leu 



Gly Gly Ala Gly 
5 

Val Gin Gly Gly 



Ala Ala Val Ser 
40 

Ala Pro Ala Pro 
55 

Gly Asp Gly Tyr 
70 

Phe Ser Ser Asp 
85 

Cys Val Val Met 



Leu lie Lys Tyr 
120 

Phe Leu Lys Asp 



Ser Ser Arg Arg 
10 

Ser Gly Pro Lys 
25 

Pro Asp Leu Gly 



Ala His Thr Arg 
60 

Trp Asp Leu Arg 
75 

Ser Gly Phe Ser 
90 

Leu lie Leu Ser 
105 

Gly lie Leu Val 



Pro Tyr Ser Trp 



Arg Arg Thr Gly 
15 

Val Glu Glu Asp 
30 

Ala Gly Gly Asp 
45 

Asp Lys Asp Gly 



Cys His Arg Leu 
80 

Asn Tyr Arg Gly 
95 

Asn Ala Arg Leu 
110 

Asp Pro lie Gin 
125 

Pro Ala Pro Cys 
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130 135 140 

Val He He Ala Ser Asn He Phe Val Val Ala Ala Phe Gin He Glu 
145 150 155 160 

Lys Arg Leu Ala Val Gly Ala Leu Thr Glu Gin Met Gly Leu Leu Leu 
165 170 175 

His Val Val Asn Leu Ala Thr He He Cys Phe Pro Ala Ala Val Ala 
180 185 190 

Leu Leu Val Glu Ser He Thr Pro Val Gly Ser Val Phe Ala Leu Ala 
195 200 205 

Ser Tyr Ser He Met Phe Leu Lys Leu Tyr Ser Tyr Arg Asp Val Asn 
210 215 220 

Leu Trp Cys Arg Gin Arg Arg Val Lys Ala Lys Ala Val Ser Thr Gly 
225 230 235 240 

Lys Lys Val Ser Gly Ala Ala Ala Gin Gin Ala Val Ser Tyr Pro Asp 
245 250 255 

Asn Leu Thr Tyr Arg Asp Leu Tyr Tyr Phe He Phe Ala Pro Thr Leu 
260 265 270 

Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro Arg He Arg Lys Arg Phe 
275 280 285 

Leu Leu Arg Arg Val Leu Glu Met Leu Phe Phe Thr Gin Leu Gin Val 
290 295 300 

Gly Leu He Gin Gin Trp Met Val Pro Thr He Gin Asn Ser Met Lys 
305 310 315 320 

Pro Phe Lys Asp Met Asp Tyr Ser Arg He He Glu Arg Leu Leu Lys 
325 330 335 

Leu Ala Val Pro Asn His Leu He Trp Leu He Phe Phe Tyr Trp Phe 
340 345 350 

Phe His Ser Cys Leu Asn Ala Val Ala Glu Leu Leu Gin Phe Gly Asp 
355 360 365 

Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ala Glu Ser Val Thr Tyr Phe 
370 375 380 

Trp Gin Asn Trp Asn He Pro Val His Lys Trp Cys He Arg His Phe 
385 390 395 400 

Tyr Lys Pro Met Leu Arg His Gly Ser Ser Lys Trp Val Ala Arg Thr 
405 410 415 

Gly Val Phe Leu Thr Ser Ala Phe Phe His Glu Tyr Leu Val Ser Val 
420 425 430 



Pro Leu Arg Met Phe Arg Leu Trp Ala Phe Thr Ala Met Met Ala Gin 
435 440 445 
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Val Pro Leu Ala Trp lie Val Gly Arg Phe Phe Gin Gly Asn Tyr Gly 
450 455 460 

Asn Ala Ala Val Trp Val Thr Leu lie He Gly Gin Pro Val Ala Val 
465 470 475 480 

Leu Met Tyr Val His Asp Tyr Tyr Val Leu Asn Tyr Asp Ala Pro Val 
485 490 495 

Gly Val 



<210> 5 
<211> 488 
<212> PRT 
<213> human 



<400> 5 

Met Gly Asp Arg 
1 

Ser Ser His Gly 
20 

Asp Ala Ala Ala 
35 

Pro Ala Pro Asn 
50 

Glu Leu Arg Cys 
65 

Gly Phe Ser Asn 



He Leu Ser Asn 
100 

He Leu Val Asp 
115 

His Ser Trp Pro 
130 

Val Ala Ala Phe 
145 

Glu Gin Ala Gly 



Cys Phe Pro Ala 
180 

Gly Ser Leu Leu 
195 



Gly Ser Ser Arg 
5 

Gly Gly Gly Pro 



Gly Pro Asp Val 
40 

Lys Asp Gly Asp 
55 

His Arg Leu Gin 
70 

Tyr Arg Gly He 
85 

Ala Arg Leu Phe 



Pro He Gin Val 
120 

Ala Pro Cys Leu 
135 

Gin Val Glu Lys 
150 

Leu Leu Leu His 
165 

Ala Val Val Leu 



Ala Leu Met Ala 
200 



Arg Arg Arg Thr 
10 

Ala Ala Ala Glu 
25 

Gly Ala Ala Gly 



Ala Gly Val Gly 
60 

Asp Ser Leu Phe 
75 

Leu Asn Trp Cys 
90 

Leu Glu Asn Leu 
105 

Val Ser Leu Phe 



Val He Ala Ala 
140 

Arg Leu Ala Val 
155 

Val Ala Asn Leu 
170 

Leu Val Glu Ser 
185 

His Thr He Leu 



Gly Ser Arg Pro 
15 

Glu Glu Val Arg 
30 

Asp Ala Pro Ala 
45 

Ser Gly His Trp 



Ser Ser Asp Ser 
80 

Val Val Met Leu 
95 

He Lys Tyr Gly 
110 

Leu Lys Asp Pro 
125 

Asn Val Phe Ala 



Gly Ala Leu Thr 
160 

Ala Thr He Leu 
175 

He Thr Pro Val 
190 

Phe Leu Lys Leu 
205 
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Phe Ser Tyr Arg Asp Val Asn Ser Trp Cys Arg Arg Ala Arg Ala Lys 
210 215 220 

Ala Ala Ser Ala Gly Lys Lys Ala Ser Ser Ala Ala Ala Pro His Thr 
225 230 235 240 

Val Ser Tyr Pro Asp Asn Leu Thr Tyr Arg Asp Leu Tyr Tyr Phe Leu 
2 45 250 255 

Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro Arg 
260 265 270 

He Arg Lys Arg Phe Leu Leu Arg Arg He Leu Glu Met Leu Phe Phe 
275 280 285 

Thr Gin Leu Gin Val Gly Leu He Gin Gin Trp Met Val Pro Thr He 
290 295 300 

Gin Asn Ser Met Lys Pro Phe Lys Asp Met Asp Tyr Ser Arg He He 
305 310 315 320 

Glu Arg Leu Leu Lys Leu Ala Val Pro Asn His Leu He Trp Leu He 
325 330 335 

Phe Phe Tyr Trp Leu Phe His Ser Cys Leu Asn Ala Val Ala Glu Leu 
340 345 350 

Met Gin Phe Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ser Glu 
355 360 365 

Ser Val Thr Tyr Phe Trp Gin Asn Trp Asn He Pro Val His Lys Trp 
370 375 380 

Cys He Arg His Phe Tyr Lys Pro Met Leu Arg Arg Gly Ser Ser Lys 
385 390 395 400 

Trp Met Ala Arg Thr Gly Val Phe Leu Ala Ser Ala Phe Phe His Glu 
405 410 415 

Tyr Leu Val Ser Val Pro Leu Arg Met Phe Arg Leu Trp Ala Phe Thr 
420 425 430 

Gly Met Met Ala Gin He Pro Leu Ala Trp Phe Val Gly Arg Phe Phe 
435 440 445 

Gin Gly Asn Tyr Gly Asn Ala Ala Val Trp Leu Ser Leu He He Gly 
450 455 460 

Gin Pro He Ala Val Leu Met Tyr Val His Asp Tyr Tyr Val Leu Asn 
465 470 475 480 

Tyr Glu Ala Pro Ala Ala Glu Ala 



485 



<210> 
<211> 
<212> 



6 

629 
DNA 



<213> 



Arabidopsis thaliana 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



misc__f eature 
(455) . . (455) 
n is any nucleotide 



mi sc_f eature 
(464) . . (464) 
n is any nucleotide 



misc_f eature 
(467) . . (467) 
n is any nucleotide 



mi sc_f eature 
(475) . . (475) 
n is any nucleotide 



misc_f eature 
(497) . . (497) 
n is any nucleotide 



mi sc_f eature 
(500) . . (500) 
n is any nucleotide 



mi sc_f eature 
(508) . . (508) 
n is any nucleotide 



mi sc__f eature 
(514) . . (514) 
n is any nucleotide 



mi sc_f eature 
(519) . . (519) 
n is any nucleotide 



misc_f eature 
(536) . . (536) 
n is any nucleotide 



mi sc__f eature 
(543) . . (544) 
n is any nucleotide 



<220> 
<221> 



misc feature 
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<222> (576) . . (576) 

<223> n is any nucleotide 

<220> 

<221> misc_f eature 

<222> (583) . . (584) 

<223> n is any nucleotide 

<220> 

<221> misc_feature 

<222> (597) . . (597) 

<223> n is any nucleotide 

<400> 6 

tgcatgtata cggaagggtt gggtggctcg tcaatttgca aaactggtca tattcaccgg 60 

attcatggga tttataatag aacaatatat aaatcctatt gtcaggaact caaagcatcc 120 

tttgaaaggc gatcttctat atgctattga aagagtgttg aagctttcag ttccaaattt 180 

^ atatgtgtgg ctctgcatgt tctactgctt cttccacctt tggttaaaca tattggcaga 240 

!jj gcttctctgc ttcggggatc gtgaattcta caaagattgg tggaatgcaa aaagtgtggg 3 00 

agattactgg gagaatgtgg aatatgcctg tccataaatg ggatgggtcc gacatatata 3 60 

M ccttccccgt gcttgcgcac aaggattacc caaagacacc ccggccatta accattggct 42 0 

s ttcccaagcc ccctggaggc ctttccatgg gccanggacc cggngtnccc tggcnggccc 48 0 

r7§ ttcaaagcaa agggggnttn cctggggnta aagntccang ggcccttggg gcccanccaa 54 0 

WW 

JJJ aannttcccc cgggaaaggg ttgcccaccg gggggngaaa aanncccggg ggcaccncgg 600 

Ql aattttggga acccgggggg ggccttttt 62 9 

<210> 7 
<211> 11 
<212> PRT 

<213> Arabidops thaliana 
<400> 7 

His Lys Trp Met Val Arg His lie Tyr Phe Pro 
15 10 



<210> 8 
<211> 341 
<212> PRT 

<213> Brassica napus 



<400> 8 

Met Cys Cys Leu Ser Leu Ser lie Phe Pro Leu Ala Ala Phe Thr Val 
15 10 15 

Glu Lys Leu Val Leu Gin Lys Cys lie Ser Glu Pro Val Val He Phe 
20 25 30 
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Leu His Val lie lie Thr Met Thr Glu Val Leu Tyr Pro Val Tyr Val 
35 40 45 

Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Asp Thr Leu Met Leu 
50 55 60 

Leu Thr Cys lie Val Trp Leu Lys Leu Val Ser Tyr Ala His Thr Asn 
65 70 75 80 

Tyr Asp lie Arg Thr Leu Ala Asn Ser Ser Asp Lys Ala Asn Pro Glu 
85 90 95 

Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met Leu Ala 
100 105 110 

Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Pro Cys lie Arg 

115 120 125 

Lys Gly Trp Val Ala Arg Gin Phe, Ala Lys Leu Val lie Phe Thr Gly 
130 135 140 

Leu Met Gly Phe lie lie Glu Gin Tyr lie Asn Pro lie Val Arg Asn 
145 150 155 160 

Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala lie Glu Arg Val 
165 170 175 

Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met Phe Tyr 
180 185 190 

Cys Phe Phe His Leu Trp Leu Asn lie Leu Ala Glu Leu Leu Cys Phe 
195 200 205 

Gly Asp Arg Glu Phe Tyr Lys ^ Asp Trp Trp Asn Ala Lys Ser Val Gly 
210 215 220 

Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met Val Arg 
225 230 235 240 

His Val Tyr Phe Pro Cys Leu Arg lie Lys lie Pro Lys Val Pro Ala 
245 250 255 

lie lie lie Ala Phe Leu Val Ser Ala Val Phe His Glu Leu Cys lie 
260 265 270 

Ala Val Pro Cys Arg Leu Phe Asn Leu Trp Ala Phe Met Gly lie Met 
275 280 285 

Phe Gin Val Pro Leu Val Phe lie Thr Asn Phe Leu Gin Glu Arg Phe 
290 295 300 

Gly Ser Met Val Gly Asn Met lie Phe Gly Ser Ala Ser Cys lie Phe 
305 310 315 320 

Gly Gin Pro Met Cys Gly Leu Leu Tyr Tyr His Asp Leu Met Asn Arg 
325 330 335 



Lys Gly Ser Met Ser 
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340 



<210> 9 
<211> 503 
<212> PRT 

<213> Brassica napus 
<400> 9 

Met Ala lie Leu Asp Ser Gly Gly Val Ala Val Pro Pro Thr Glu Asn 
15 10 15 

Gly Val Ala Asp Leu Asp Arg Leu His Arg Arg Lys Ser Ser Ser Asp 
20 25 30 

Ser Ser Asn Gly Leu Leu Ser Asp Thr Ser Pro Ser Asp Asp Val Gly 
35 40 45 

Ala Ala Ala Ala Glu Arg Asp Arg Val Asp Ser Ala Ala Glu Glu Glu 
50 55 60 

Ala Gin Gly Thr Ala Asn Leu Ala Gly Gly Asp Ala Glu Thr Arg Glu 
65 70 75 80 

Ser Ala Gly Gly Asp Val Arg Phe Thr Tyr Arg Pro Ser Val Pro Ala 
85 90 95 

His Arg Arg Thr Arg Glu Ser Pro Leu Ser Ser Asp Ala lie Phe Lys 
100 105 110 

Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu Val Ala 
115 120 125 

Val Asn Ser Arg Leu lie lie Glu Asn Leu Met Lys Tyr Gly Trp Leu 
130 135 140 

lie Arg Thr Asp Phe Trp Phe Ser Ser Thr Ser Leu Arg Asp Trp Pro 
145 150 155 160 

Leu Phe Met Cys Cys Leu Ser Leu Ser Val Phe Pro Leu Ala Ala Phe 
165 170 175 

Thr Val Glu Lys Met Val Leu Gin Lys Phe lie Ser Glu Pro Val Ala 
180 185 190 

lie lie Leu His Val lie lie Thr Met Thr Glu Val Leu Tyr Pro Val 
195 200 205 

Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr Leu 
210 215 220 

Met Leu Leu Thr Cys lie Val Trp Leu Lys Leu Val Ser Tyr Ala His 
225 230 235 240 

Thr Ser Tyr Asp lie Arg Thr Leu Ala Asn Ser Ala Asp Lys Val Asp 
245 250 255 



Pro Glu lie Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met 
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260 265 270 

Val Ala Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Pro Cys 
275 280 285 

lie Arg Lys Gly Trp Val Ala Arg Gin Leu Ala Lys Leu Val lie Phe 
290 295 300 

Thr Gly Leu Met Gly Phe lie lie Glu Gin Tyr lie Asn Pro lie Val 
305 310 315 320 

Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala lie Glu 
325 330 335 

Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met 
340 345 350 

Phe Tyr Cys Phe Phe His Leu Trp Leu Asn lie Leu Ala Glu Leu Leu 
355 360 365 

Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Ser 
370 375 380 

Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met 
385 390 395 400 

Val Arg His Val Tyr Phe Pro Cys Leu Arg lie Lys lie Pro Lys Val 
405 410 415 

Pro Ala lie lie lie Ala Phe Leu Val Ser Ala Val Phe His Glu Leu 
420 425 430 

Cys lie Ala Val Pro Cys Arg Leu Phe Asn Leu Trp Ala Phe Met Gly 
435 440 445 

lie Met Phe Gin Val Pro Leu Val Phe lie Thr Asn Phe Leu Gin Glu 
450 455 460 

Arg Phe Gly Ser Met Val Gly Asn Met lie Phe Gly Ser Ala Ser Cys 
465 470 475 480 

lie Phe Gly Gin Pro Met Cys Gly Leu Leu Tyr Tyr His Asp Leu Met 
485 490 495 

Asn Arg Lys Gly Ser Met Ser 
500 



<210> 10 

<211> 11 

<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<221> misc_f eature 

<222> (1) . . (l) 

<223> Xaa is any amino acid 
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<220> 

<221> misc_feature 

<222> (3) . . (3) 

<223> Xaa is any amino acid 

<220> 

<221> misc_f eature 

<222> (5) . . (6) 

<223> Xaa is any amino acid 

<220> 

<221> misc_feature 

<222> (8) . . (10) 

<223> Xaa is any amino acid 

<400> 10 

Xaa Leu Xaa Leu Xaa Xaa Ser Xaa Xaa Xaa Val 
15 10 



<210> 11 
<211> 27 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 11 

Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu lie Ala Val 
15 10 15 

Asn Ser Arg Leu lie lie Glu Asn Leu Met Lys 
20 25 



<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer of 
DGATXbal 

<400> 12 

ctagtctaga atggcgattt tgga 24 



<210> 13 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer of 
DGATXhol 



<400> 13 



15/21 



gcgctcgagt ttcatgacat cga 

<210> 14 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT1 
<400> 14 

agacacgaat cccattccca ccga 

<210> 15 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT2 
<400> 15 

agtggtgaca acgcagggat gatg 

<210> 16 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT3 

<400> 16 

atggtcgctc ccacattgtg t 

<210> 17 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT4 
<400> 17 

catacaatcc ccatgacatt tatca 

<210> 18 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer A 



16/21 



<400> 18 

cgaccgtcgg ttccagctca tcgg 



24 



<210> 19 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer B 
<400> 19 

gcggccaatc tcgcagcgat cttg 24 

<210> 20 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer C 



<210> 21 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer Gen 1 
<400> 21 

gagaggatcc acgctcacga cccattcttc ccg 33 

<210> 22 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer Gen 2 



<400> 20 

taaacagtag actcatcatc g 



21 



<400> 22 

aagaaggatc catccccaaa acgggaccac caa 



33 



<210> 23 

<211> 1985 

<212> DNA 

<213> Arabidopsis thai i ana 



<400> 23 

atttcttagc ttcttccttc aatccgctct ttccctctcc attagattct gtttcctctt 60 
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tcaatttctt 
cgtcaaacgc 
aacggtggcg 
tcttctaacg 
gctcccgccg 
gccaatttgg 
ggagaaggaa 
cggagggcga 
ttattcaacc 
cttatgaaga 
agtagactca 
tttagttcaa 
tttcctttgg 
gttgtcatct 
accctaaggt 
gtgtggctaa 
gcagctgata 
ttcatggtcg 
aa 99gttggg 
ataatagaac 
cttctatatg 
tgcatgttct 
ggggatcgtg 
atgtggaata 
agcaagatac 
gagctatgca 
tttcaggtgc 
gggaacatga 
tattaccacg 
gactttcttc 
gctaaaacga 
tctgcaaaaa 
gccgc 



ctgcatgctt 
ttttcgaaat 
gagagttcgt 
gacttcttct 
acgttaggga 
ccggagataa 
gaggaaacgc 
gagagagtcc 
tctgtgtagt 
gccatgccgg 
tcatcgaaaa 
gatcgctgcg 
ctgcctttac 
ttcttcatat 
gtgattctgc 
agttggtttc 
aggccaatcc 
ctcccacatt 
tggctcgtca 
aatatataaa 
ctattgaaag 
actgcttctt 
aattctacaa 
tgcctgttca 
caaagacact 
tcgcagttcc 
ctttggtctt 
tcttctggtt 
acctgatgaa 
aaacatctat 
caaatagtgt 
ttttggtaga 



ctcgattctc 
ggcgattttg 
cgatcttgat 
ctctggttcc 
tcggattgat 
taacggtggt 
cgatgctacg 
acttagctcc 
agttcttatt 
attattcaac 
tcttatgaag 
agattggccg 
ggttgagaaa 
tattatcacc 
ttttttatca 
ttatgctcat 
tgaagtctcc 
gtgttatcag 
atttgcaaaa 
tcctattgtc 
agtgttgaag 
ccacctttgg 
agattggtgg 
taaatggatg 
cgccattatc 
ttgtcgtctc 
catcacaaac 
catcttctgc 
ccgaaaagga 
ggcctcgttg 
tataaccatt 
gacacgcaaa 



tctgacgcct 
gattctgctg 
aggcttcgtc 
gataataatt 
tccgttgtta 
ggcgataata 
tttacgtatc 
gacgcaatct 
gctgtaaaca 
ctctgtgtag 
tatggttggt 
cttttcatgt 
ttggtacttc 
atgacagagg 
ggtgtcactt 
actagctatg 
tactacgtta 
ccaagttatc 
ctggtcatat 
aggaactcaa 
ctttcagttc 
ttaaacatat 
aatgcaaaaa 
gttcgacata 
attgctttcc 
ttcaagctat 
tatctacagg 
attttcggac 
tcgatgtcat 
gatctccgtt 
gaagaagaaa 
cccgtttgga 



cttttctccc 
gcgttactac 
gacggaaatc 
ctccttcgga 
acgatgacgc 
acggtggtgg 
gaccgtcggt 
tcaaacagag 
gtagactcat 
tagttcttat 
tgatcagaac 
gttgtatatc 
agaaatacat 
ttttgtatcc 
tgatgctcct 
acataagatc 
gcttgaagag 
cacgttctgc 
tcaccggatt 
agcatccttt 
caaatttata 
tggcagagct 
gtgtgggaga 
tatacttccc 
tagtctctgc 
gggcttttct 
aaaggtttgg 
aaccgatgtg 
gaaacaactg 
gatgttgtgg 
agaaaattag 
ttttgttatg 



gacgctgttt 
ggtgacggag 
gagatcggat 
tgatgttgga 
tcagggaaca 
aagaggcggc 
tccagctcat 
ccatgccgga 
catcgaaaat 
tgctgtaaac 
ggatttctgg 
cctttcgatc 
atcagaacct 
agtttacgtc 
cacttgcatt 
cctagccaat 
cttggcatat 
atgtatacgg 
catgggattt 
gaaaggcgat 
tgtgtggctc 
tctctgcttc 
ttactggaga 
gtgcttgcgc 
agtctttcat 
tgggattatg 
ctcaacggtg 
tgtgcttctt 
ttcaaaaaat 
tggttctgat 
agttgttgta 
gtgtaaagcg 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

1985 



<210> 24 
<211> 5339 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 24 

gctcacgacc 

tttgtataac 

caataggggg 

attatactaa 

aaaaaatatg 

tcaaaaccgt 

atatcgtatc 

taaatatata 

gtcaaaaacc 

agtattaatt 

caaaaaagta 

aaagtacttc 

gaaagtttat 

ttacaaataa 

catttcatat 

aggattgttg 

ctttttaata 

tccaagttta 



cattcttccc 
attcaaatct 
agttgactta 
agaaaatttt 
tattgttaat 
attcatagac 
tccaaatcaa 
tatatatata 
atgactgaat 
tccattatgt 
gatttgtgta 
cctaattaaa 
taaattcgga 
tcaacttatc 
tctattttga 
tttagaaaaa 
tataattgac 
taataaatac 



gttccatttg 
tcacatgatt 
aaatagctat 
tgatttgtct 
cttaaaaatg 
cgatgtgaga 
tcacttagaa 
tatatatatc 
caaactataa 
ttctatactt 
tattattaga 
tcatgtcggt 
tctttttttt 
cttactacta 
tgtttaagaa 
aaagtacaag 
ttattgaaca 
atttcaaaga 



gttttattta 
gattgtgtga 
tgatgtcgaa 
gttgtttaag 
taggagtaca 
atcaaataga 
gataatgtaa 
ttgtatatat 
gtcggattga 
tacaaaccgg 
agatttggaa 
tgaaaaagct 
ttgtttgtcg 
aatcatttca 
aacactattt 
ttgaattctt 
tgattacaga 
ctattagttc 



tttcaaagtt 
aaaccccaca 
aaaatgtatt 
catatgtatt 
catcaaatac 
agataatgtg 
ttctttatgt 
gtcttgacaa 
atcaaactat 
aaaatagata 
tttcatcatt 
caatgaatgt 
tcccaaacat 
tatctttgat 
accagttaca 
tttgtcaaat 
attaatcatc 
ttcttaaaat 



taatattcct 
gattttacta 
ttagttataa 
gttaaactta 
tcgagcataa 
attttttaaa 
gctacataaa 
aaaattgcca 
aagtcggatg 
ttatagatac 
atcaggatct 
ttgaaatttg 
ttttatttta 
accaacaaat 
aaatattata 
ataaaattga 
tacaaaactt 
atttctaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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gtgatcaaag actaccacat ataattcaga 
taaataattg acttaaaata gtttggaaag 
tacataaaaa aattccaagt ttataataaa 
ttaaaatatt tcactaaaaa aacactcaaa 
actgtaaaac agaatttgac aaaaaaaaaa 
aaatcaccag aggatcttat gcaaaaaaat 
ttttaaaata aaataaaaac agaaaaatat 
cgtcaccatt gttgtcatct tcctctctcg 
tcttcagaga aaactttgct tctctttcta 
cgatttctta gcttcttcct tcaatccgct 
tttcaatttc ttctgcatgc ttctcgattc 
ttcgtcaaac gcttttcgaa atggcgattt 
agaacggtgg cggagagttc gtcgatcttg 
attcttctaa cggacttctt ctctctggtt 
gagctcccgc cgacgttagg gatcggattg 
cagccaattt ggccggagat aataacggtg 
gcggagaagg aagaggaaac gccgatgcta 
atcggagggc gagagagagt ccacttagct 
tcagaaatct tcgaatttgg tgtttgcttg 
gttttgcatt gcagagccat gccggattat 
taaacagtag actcatcatc gaaaatctta 
taggaattga attgcttgaa aatttatcat 
cctctgtgta gtagttctta ttgctgtaaa 
ggtttgctgt tacttgtttc tccttttagg 
gaataacttt gttgttgcta tcattcatgt 
ggtttagttc aagatcgctg cgagattggc 
tttttatttc cagcaatgtt acattgttat 
cctctttgat tcttctttct tgttgcagta 
ttacggttga gaaattggta cttcagaaat 
ctccagccat tactgtaatt tttattgaag 
tctgttttga aaatgctcaa ggttgtcatc 
gttttgtatc cagtttacgt caccctaagg 
tactgttttt aagtttagtt gtctgacccg 
gcttttttat caggtgtcac tttgatgctc 
tcttatgctc atactagcta tgacataaga 
tacgaaaaag aagcgtatgt attagtcact 
gttatgaact ttaggccaat cctgaagtct 
atttcatggt cgctcccaca ttgtgttatc 
atacttgcaa gagtttcttg tctaaacctc 
cacgttctgc atgtatacgg aagggttggg 
tcaccggatt catgggattt ataatagaac 
tttccttggt gaaaatcatc atccctgcgt 
acattttggc agtatataaa tcctattgtc 
cttctatatg ctattgaaag agtgttgaag 
tgcatgttct actgcttctt ccacctttgg 
tttgcaaatt cgaaaaaccg aaaaaggcta 
cttagagtcg gtgatgtaat ttcagttact 
atattggcag agcttctctg cttcggggat 
aaaagtgtgg gagatgtgag ctattttact 
cgttgttttt gggtcatcta actaaccaaa 
actggagaat gtggaatatg gtatggttct 
aaatagaaga agagagctaa ttaagatctt 
tggttcgaca tatatacttc ccgtgcttgc 
ataccgatat gcaattgtcg agatttgttt 
tgtttttcag acactcgcca ttatcattgc 
atacatactt tctacattgc cctgtctcta 
gctaatattc aaagcattgt ttttacttaa 
gctatgcatc gcagttcctt gtcgtctctt 
tcaggttaaa aaattactaa actgctgcag 



aaaagtagaa gttgatttct ttttgtcaaa 1140 
ccattgaact tgattataga attgataatg 1200 
tacatttttc aaatgctata tcagttcttc 1260 
tatagaataa atttattgaa taacatacca 1320 
aaaaaatgaa atgaagatga agacaaaaat 13 8 0 
atatgaatac acaataaacc atattgatat 1440 
cccaacaccg cttttcaatt aaaaatcttc 1500 
tgaatccttt ttcctttctt cttcttcttc 1560 
taaggaacca gacacgaatc ccattcccac 1620 
ctttccctct ccattagatt ctgtttcctc 1680 
tctctgacgc ctcttttctc ccgacgctgt 1740 
tggattctgc tggcgttact acggtgacgg 1800 
ataggcttcg tcgacggaaa tcgagatcgg 18 6 0 
ccgataataa ttctccttcg gatgatgttg 192 0 
attccgttgt taacgatgac gctcagggaa 1980 
gtggcgataa taacggtggt ggaagaggcg 2040 
cgtttacgta tcgaccgtcg gttccagctc 2100 
ccgacgcaat cttcaaacag gtttaaaatc 2160 
ttgttttata tggaattgag tttggtgatt 2220 
tcaacctctg tgtagtagtt cttattgctg 2280 
tgaaggtttg ctgttacttg tttctccttt 2340 
tgcattgcag agccatgccg gattattcaa 2400 
cagtagactc atcatcgaaa atcttatgaa 2460 
aattgaattg cttgaaaatt tatcagagac 2520 
agtatggttg gttgatcaga acggatttct 2580 
cgcttttcat gtgttggtaa aagaagatgt 2640 
acgtataatg atgagtttag tgatcaagtt 2700 
tatccctttc gatctttcct ttggctgcct 2760 
acatatcaga acctgtgagt aattactatt 2820 
acaagtttgt atcatgaaga acttacaagt 2880 
tttcttcata ttattatcac catgacagag 2 940 
tgatactgtt tttctggtct cagtttgtga 3 000 
gtgatcttga aaatggacag gtgtgattct 3 060 
ctcacttgca ttgtgtggct aaagttggtt 3120 
tccctagcca atgcagctga taaggtaaaa 318 0 
tgcactgtgt tactgtttta accaaacact 3240 
cctactacgt tagcttgaag agcttggcat 3 3 00 
aggtaactgc aaagtgcatc aaccattctt 3360 
ggatctttgc ttttccccag ccaagttatc 3420 
tggctcgtca atttgcaaaa ctggtcatat 3480 
aagtacgttt tcacatcttg ctttattagt 3540 
tgtcaccact tgacttcatg ttcttttgtt 3600 
aggaactcaa agcatccttt gaaaggcgat 3 660 
ctttcagttc caaatttata tgtgtggctc 3720 
tatgctgtga tcccatctct ttcaaaataa 3780 
aatctcatac gaatttgata tttttagttt 3840 
gaacgcaaat ctcttgtcca aaggttaaac 3 900 
cgtgaattct acaaagattg gtggaatgca 3 96 0 
caaaagaaaa cttatgattt ttaatgttgt 4020 
ttcatgtatt cactgtcttc ctttatcagt 4080 
cttcctaaac atcaccttct tttgtacaca 4140 
gttttccttg acagcctgtt cataaatgga 4200 
gcagcaagat accaaaggtg agtgagatat 42 6 0 
ctgtgatata aatttaaccc tccacacact 4320 
tttcctagtc tctgcagtct ttcatgaggt 4380 
gacgcatgaa cacacgctag tgaaagaaat 4440 
cgatcttgtg- ttacaaattt ccttttgaca 4500 
caagctatgg gcttttcttg ggattatgtt 456 0 
tcgattttta ctaaactcta atctcatatt 4620 
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ctgaccaacc aatttgtttg agtaggtgcc tttggtcttc atcacaaact atctacagga 4680 
aaggtttggc tcaacggtat gctctcaaaa cccgagaaaa tagaacgaat aactctttct 4740 
ttcatagcct agccatttaa atcgcaatgc tgaaacttaa taataaaggt gatctgtttt 4800 
ggaatgggat catattatta ggtggggaac atgatcttct ggttcatctt ctgcattttc 4860 
ggacaaccga tgtgtgtgct tctttattac cacgacctga tgaaccgaaa aggatcgatg 4920 
tcatgaaaca actgttcaaa aaatgacttt cttcaaacat ctatggcctc gttggatctc 4980 
cgttgatgtt gtggtggttc tgatgctaaa acgacaaata gtgttataac cattgaagaa 5040 
gaaaagaaaa ttagagttgt tgtatctgca aaaattttgg tagagacacg cgaacccgtt 5100 
tggattttgt tatggtgtaa agaaatttca atcaaaaaac tgttgtaata attgttacca 5160 
aaaagaaatg cttttctgga aacgagggga aaaatagtag ttttgttagg ttttactgtt 5220 
tggaccaaat ctagtaaaaa actttttgta ataaggaaaa aaaaagaaca aatgtgataa 52 8 0 
atgcatgggg attgtatgaa accttccaat aaagttgatt ggtggtcccg ttttgggga 533 9 



<210> 25 
<211> 547 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 25 

Met Ala lie Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 
15 10 15 

Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 
20 25 30 

Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 
35 40 45 

Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg lie Asp 
50 55 60 

Ser Val Val Asn Asp Asp Ala Gin Gly Thr Ala Asn Leu Ala Gly Asp 
65 70 75 80 

Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 
85 90 95 

Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 
100 105 HO 

Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala He Phe 
115 120 125 

Lys Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu He 
130 135 140 

Ala Val Asn Ser Arg Leu He He Glu Asn Leu Met Lys Ser His Ala 
145 150 155 160 

Gly Leu Phe Asn Leu Cys Val Val Val Leu He Ala Val Asn Ser Arg 
165 170 175 

Leu He He Glu Asn Leu Met Lys Tyr Gly Trp Leu He Arg Thr Asp 
180 185 190 



Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp Pro Leu Phe Met Cys 
195 200 205 
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Cys lie Ser Leu 
210 

Leu Val Leu Gin 
225 

lie lie He Thr 



Arg Cys Asp Ser 
260 

Cys He Val Trp 
275 

He Arg Ser Leu 
290 

Tyr Tyr Val Ser 
305 

Leu Cys Tyr Gin 



Trp Val Ala Arg 
340 

Gly Phe He He 
355 

His Pro Leu Lys 
370 

Leu Ser Val Pro 
385 

Phe His Leu Trp 



Arg Glu Phe Tyr 
420 

Trp Arg Met Trp 
435 

Tyr Phe Pro Cys 
450- 

Ile Ala Phe Leu 
465 

Pro Cys Arg Leu 



Val Pro Leu Val 
500 

Thr Val Gly Asn 
515 



Ser He Phe Pro 
215 

Lys Tyr He Ser 
230 

Met Thr Glu Val 
245 

Ala Phe Leu Ser 



Leu Lys Leu Val 
280 

Ala Asn Ala Ala 
295 

Leu Lys Ser Leu 
310 

Pro Ser Tyr Pro 
325 

Gin Phe Ala Lys 



Glu Gin Tyr He 
360 

Gly Asp Leu Leu 
375 

Asn Leu Tyr Val 
390 

Leu Asn He Leu 
405 

Lys Asp Trp Trp 



Asn Met Pro Val 
440 

Leu Arg Ser Lys 
455 

Val Ser Ala Val 
470 

Phe Lys Leu Trp 
485 

Phe He Thr Asn 



Met He Phe Trp 
520 



Leu Ala Ala Phe 

220 

Glu Pro Val Val 
235 

Leu Tyr Pro Val 
250 

Gly Val Thr Leu 
265 

Ser Tyr Ala His 



Asp Lys Ala Asn 
300 

Ala Tyr Phe Met 
315 

Arg Ser Ala Cys 
330 

Leu Val He Phe 
345 

Asn Pro He Val 



Tyr Ala He Glu 
380 

Trp Leu Cys Met 
395 

Ala Glu Leu Leu 
410 

Asn Ala Lys Ser 
425 

His Lys Trp Met 



He Pro Lys Thr 
460 

Phe His Glu Leu 
475 

Ala Phe Leu Gly 
490 

Tyr Leu Gin Glu 
505 

Phe He Phe Cys 



Thr Val Glu Lys 



He Phe Leu His 
240 

Tyr Val Thr Leu 
255 

Met Leu Leu Thr 
270 

Thr Ser Tyr Asp 
285 

Pro Glu Val Ser 



Val Ala Pro Thr 
320 

He Arg Lys Gly 
335 

Thr Gly Phe Met 
350 

Arg Asn Ser Lys 
365 

Arg Val Leu Lys 



Phe Tyr Cys Phe 
400 

Cys Phe Gly Asp 
415 

Val Gly Asp Tyr 
430 

Val Arg His He 
445 

Leu Ala He He 



Cys He Ala Val 
480 

He Met Phe Gin 
495 

Arg Phe Gly Ser 
510 

He Phe Gly Gin 
525 
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Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu Met Asn Arg Lys Gly 
530 535 540 



Ser Met Ser 
545 
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SEQUENCE LISTING 



<110> National Research Council of Canada 
Zou, Jitao 
Taylor, David C 
Wei, Yangdou 
Jako, Colette C 

<12 0> Diacylglycerol Acyl Transferase Gene from Plants 

<i30> ^ererzrzpt^ i^l^lt 

<140>|PCT/CA> ^^^ 2 A!!1 

<150> 60/112,812 ( OSl> l^^^- I ^ " 

<151> 1998-12-17 V. t — - — " ~ ~ 

<160> 25 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1904 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 1 

atttcttagc ttcttccttc aatccgctct ttccctctcc attagattct gtttcctctt 60 
tcaatttctt ctgcatgctt ctcgattctc tctgacgcct cttttctccc gacgctgttt 120 
cgtcaaacgc ttttcgaaat ggcgattttg gattctgctg gcgttactac ggtgacggag 180 
aacggtggcg gagagttcgt cgatcttgat aggcttcgtc gacggaaatc gagatcggat 24 0 
tcttctaacg gacttcttct ctctggttcc gataataatt ctccttcgga tgatgttgga 3 00 
gctcccgccg acgttaggga tcggattgat tccgttgtta acgatgacgc tcagggaaca 360 
gccaatttgg ccggagataa taacggtggt ggcgataata acggtggtgg aagaggcggc 42 0 
ggagaaggaa gaggaaacgc cgatgctacg tttacgtatc gaccgtcggt tccagctcat 480 
cggagggcga gagagagtcc acttagctcc gacgcaatct tcaaacagag ccatgccgga 540 
ttattcaacc tctgtgtagt agttcttatt gctgtaaaca gtagactcat catcgaaaat 600 
cttatgaagt atggttggtt gatcagaacg gatttctggt ttagttcaag atcgctgcga 660 
gattggccgc ttttcatgtg ttgtatatcc ctttcgatct ttcctttggc tgcctttacg 720 
gttgagaaat tggtacttca gaaatacata tcagaacctg ttgtcatctt tcttcatatt 780 
attatcacca tgacagaggt tttgtatcca gtttacgtca ccctaaggtg tgattctgct 84 0 
tttttatcag gtgtcacttt gatgctcctc acttgcattg tgtggctaaa gttggtttct 900 
tatgctcata ctagctatga cataagatcc ctagccaatg cagctgataa ggccaatcct 96 0 
gaagtctcct actacgttag cttgaagagc ttggcatatt tcatggtcgc tcccacattg 1020 
tgttatcagc caagttatcc acgttctgca tgtatacgga agggttgggt ggctcgtcaa 1080 
tttgcaaaac tggtcatatt caccggattc atgggattta taatagaaca atatataaat 1140 
cctattgtca ggaactcaaa gcatcctttg aaaggcgatc ttctatatgc tattgaaaga 1200 
gtgttgaagc tttcagttcc aaatttatat gtgtggctct gcatgttcta ctgcttcttc 1260 
cacctttggt taaacatatt ggcagagctt ctctgcttcg gggatcgtga attctacaaa 1320 
gattggtgga atgcaaaaag tgtgggagat tactggagaa tgtggaatat gcctgttcat 1380 
aaatggatgg ttcgacatat atacttcccg tgcttgcgca gcaagatacc aaagacactc 144 0 
gccattatca ttgctttcct agtctctgca gtctttcatg agctatgcat cgcagttcct 1500 
tgtcgtctct tcaagctatg ggcttttctt gggattatgt ttcaggtgcc tttggtcttc 1560 
atcacaaact atctacagga aaggtttggc tcaacggtgg ggaacatgat cttctggttc 1620 
atcttctgca ttttcggaca accgatgtgt gtgcttcttt attaccacga cctgatgaac 1680 
cgaaaaggat cgatgtcatg aaacaactgt tcaaaaaatg actttcttca aacatctatg 1740 
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gcctcgttgg atctccgttg atgttgtggt 
ataaccattg aagaagaaaa gaaaattaga 
acacgcaaac ccgtttggat tttgttatgg 



ggttctgatg ctaaaacgac aaatagtgtt 1800 
gttgttgtat ctgcaaaaat tttggtagag 1860 
tgtaaagcgg ccgc 1904 



<210> 2 
<211> 520 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ala He Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 
15 10 15 

Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 
20 25 30 

Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 
35 40 45 

Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg He Asp 
50 55 60 

Ser Val Val Asn Asp Asp Ala Gin Gly Thr Ala Asn Leu Ala Gly Asp 
65 70 75 80 

Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 
85 90 95 

Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 
100 105 110 

Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala He Phe 
115 120 125 

Lys Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu He 
130 135 140 

Ala Val Asn Ser Arg Leu He He Glu Asn Leu Met Lys Tyr Gly Trp 
145 150 155 160 

Leu He Arg Thr Asp Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp 
165 170 175 

Pro Leu Phe Met Cys Cys He Ser Leu Ser He Phe Pro Leu Ala Ala 
180 185 190 

Phe Thr Val Glu Lys Leu Val Leu Gin Lys Tyr He Ser Glu Pro Val 
195 200 205 

Val He Phe Leu His He He He Thr Met Thr Glu Val Leu Tyr Pro 
210 215 220 

Val Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr 
225 230 235 240 

Leu Met Leu Leu Thr Cys He Val Trp Leu Lys Leu Val Ser Tyr Ala 
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245 250 255 

His Thr Ser Tyr Asp lie Arg Ser Leu Ala Asn Ala Ala Asp Lys Ala 
260 265 270 

Asn Pro Glu Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe 
275 280 285 

Met Val Ala Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Ala 
290 295 300 

Cys lie Arg Lys Gly Trp Val Ala Arg Gin Phe Ala Lys Leu Val lie 

305 310 315 320 

Phe Thr Gly Phe Met Gly Phe lie lie Glu Gin Tyr lie Asn Pro lie 
325 330 335 

Val Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala lie 
340 345 350 

Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys 
355 360 365 

Met Phe Tyr Cys Phe Phe His Leu Trp Leu Asn lie Leu Ala Glu Leu 
370 375 380 

Leu Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys 
385 390 395 400 

Ser Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp 
405 410 415 

Met Val Arg His lie Tyr Phe Pro Cys Leu Arg Ser Lys lie Pro Lys 
420 425 430 

Thr Leu Ala lie lie lie Ala Phe Leu Val Ser Ala Val Phe His Glu 
435 440 445 

Leu Cys lie Ala Val Pro Cys Arg Leu Phe Lys Leu Trp Ala Phe Leu 
450 455 460 

Gly lie Met Phe Gin Val Pro Leu Val Phe lie Thr Asn Tyr Leu Gin 
465 470 475 480 

Glu Arg Phe Gly Ser Thr Val Gly Asn Met lie Phe Trp Phe lie Phe 
485 490 495 

Cys lie Phe Gly Gin Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu 
500 505 510 

Met Asn Arg Lys Gly Ser Met Ser 
515 520 



<210> 3 
<211> 5193 



<212> DNA 

<213> Arabidopsis thaliana 



<400> 3 

gctcacgacc cattcttccc gttccatttg 
tttgtataac attcaaatct tcacatgatt 
caataggggg agttgactta aaatagctat 
attatactaa agaaaatttt tgatttgtct 
aaaaaatatg tattgttaat cttaaaaatg 
tcaaaaccgt attcatagac cgatgtgaga 
atatcgtatc tccaaatcaa tcacttagaa 
taaatatata tatatatata tatatatatc 
gtcaaaaacc atgactgaat caaactataa 
agtattaatt tccattatgt ttctatactt 
caaaaaagta gatttgtgta tattattaga 
aaagtacttc cctaattaaa tcatgtcggt 
gaaagtttat taaattcgga tctttttttt 
ttacaaataa tcaacttatc cttactacta 
catttcatat tctattttga tgtttaagaa 
aggattgttg tttagaaaaa aaagtacaag 
ctttttaata tataattgac ttattgaaca 
tccaagttta taataaatac atttcaaaga 
gtgatcaaag actaccacat ataattcaga 
taaataattg acttaaaata gtttggaaag 
tacataaaaa aattccaagt ttataataaa 
ttaaaatatt tcactaaaaa aacactcaaa 
actgtaaaac agaatttgac aaaaaaaaaa 
aaatcaccag aggatcttat gcaaaaaaat 
ttttaaaata aaataaaaac agaaaaatat 
cgtcaccatt gttgtcatct tcctctctcg 
tcttcagaga aaactttgct tctctttcta 
cgatttctta gcttcttcct tcaatccgct 
tttcaatttc ttctgcatgc ttctcgattc 
ttcgtcaaac gcttttcgaa atggcgattt 
agaacggtgg cggagagttc gtcgatcttg 
attcttctaa cggacttctt ctctctggtt 
gagctcccgc cgacgttagg gatcggattg 
cagccaattt ggccggagat aataacggtg 
gcggagaagg aagaggaaac gccgatgcta 
atcggagggc gagagagagt ccacttagct 
tcagaaatct tcgaatttgg tgtttgcttg 
gttttgcatt gcagagccat gccggattat 
taaacagtag actcatcatc gaaaatctta 
taggaattga attgcttgaa aatttatcag 
atgtagtatg gttggttgat cagaacggat 
tggccgcttt tcatgtgttg gtaaaagaag 
ttatacgtat aatgatgagt ttagtgatca 
agtatatccc tttcgatctt tcctttggct 
aaatacatat cagaacctgt gagtaattac 
gaagacaagt ttgtatcatg aagaacttac 
catctttctt catattatta tcaccatgac 
aaggtgatac tgtttttctg gtctcagttt 
cccggtgatc ttgaaaatgg acaggtgtga 
gctcctcact tgcattgtgt ggctaaagtt 
aagatcccta gccaatgcag ctgataaggt 
cacttgcact gtgttactgt tttaaccaaa 
gtctcctact acgttagctt gaagagcttg 



gttttattta tttcaaagtt taatattcct 60 
gattgtgtga aaaccccaca gattttacta 12 0 
tgatgtcgaa aaaatgtatt ttagttataa 180 
gttgtttaag catatgtatt gttaaactta 240 
taggagtaca catcaaatac tcgagcataa 300 
atcaaataga agataatgtg attttttaaa 360 
gataatgtaa ttctttatgt gctacataaa 42 0 
ttgtatatat gtcttgacaa aaaattgcca 480 
gtcggattga atcaaactat aagtcggatg 54 0 
tacaaaccgg aaaatagata ttatagatac 6 00 
agatttggaa tttcatcatt atcaggatct 660 
tgaaaaagct caatgaatgt ttgaaatttg 72 0 
ttgtttgtcg tcccaaacat ttttatttta 780 
aatcatttca tatctttgat accaacaaat 84 0 
aacactattt accagttaca aaatattata 900 
ttgaattctt tttgtcaaat ataaaattga 960 
tgattacaga attaatcatc tacaaaactt 1020 
ctattagttc ttcttaaaat atttctaaaa 1080 
aaaagtagaa gttgatttct ttttgtcaaa 114 0 
ccattgaact tgattataga attgataatg 1200 
tacatttttc aaatgctata tcagttcttc 1260 
tatagaataa atttattgaa taacatacca 132 0 
aaaaaatgaa atgaagatga agacaaaaat 1380 
atatgaatac acaataaacc atattgatat 144 0 
cccaacaccg cttttcaatt aaaaatcttc 1500 
tgaatccttt ttcctttctt cttcttcttc 1560 
taaggaacca gacacgaatc ccattcccac 162 0 
ctttccctct ccattagatt ctgtttcctc 1680 
tctctgacgc ctcttttctc ccgacgctgt 1740 
tggattctgc tggcgttact acggtgacgg 1800 
ataggcttcg tcgacggaaa tcgagatcgg 186 0 
ccgataataa ttctccttcg gatgatgttg 192 0 
attccgttgt taacgatgac gctcagggaa 1980 
gtggcgataa taacggtggt ggaagaggcg 2040 
cgtttacgta tcgaccgtcg gttccagctc 2100 
ccgacgcaat cttcaaacag gtttaaaatc 2160 
ttgttttata tggaattgag tttggtgatt 2220 
tcaacctctg tgtagtagtt cttattgctg 2280 
tgaaggtttg ctgttacttg tttctccttt 234 0 
agacgaataa ctttgttgtt gctatcattc 2400 
ttctggttta gttcaagatc gctgcgagat 24 60 
atgtttttta tttccagcaa tgttacattg 2520 
agttcctctt tgattcttct ttcttgttgc 2580 
gcctttacgg ttgagaaatt ggtacttcag 2640 
tattctccag ccattactgt aatttttatt 2700 
aagttctgtt ttgaaaatgc tcaaggttgt 2760 
agaggttttg tatccagttt acgtcaccct 282 0 
gtgatactgt ttttaagttt agttgtctga 2880 
ttctgctttt ttatcaggtg tcactttgat 2940 
ggtttcttat gctcatacta gctatgacat 3000 
aaaatacgaa aaagaagcgt atgtattagt 3 060 
cactgttatg aactttaggc caatcctgaa 312 0 
gcatatttca tggtcgctcc cacattgtgt 3180 
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tatcaggtaa ctgcaaagtg catcaaccat tcttatactt gcaagagttt cttgtctaaa 3240 
cctcggatct ttgcttttcc ccagccaagt tatccacgtt ctgcatgtat acggaagggt 33 00 
tgggtggctc gtcaatttgc aaaactggtc atattcaccg gattcatggg atttataata 3360 
gaacaagtac gttttcacat cttgctttat tagttttcct tggtgaaaat catcatccct 3420 
gcgttgtcac cacttgactt catgttcttt tgttacattt tggcagtata taaatcctat 34 80 
tgtcaggaac tcaaagcatc ctttgaaagg cgatcttcta tatgctattg aaagagtgtt 3540 
gaagctttca gttccaaatt tatatgtgtg gctctgcatg ttctactgct tcttccacct 3600 
ttggtatgct gtgatcccat ctctttcaaa ataatttgca aattcgaaaa accgaaaaag 3660 
gctaaatctc atacgaattt gatattttta gtttcttaga gtcggtgatg taatttcagt 3 72 0 
tactgaacgc aaatctcttg tccaaaggtt aaacatattg gcagagcttc tctgcttcgg 3780 
ggatcgtgaa ttctacaaag attggtggaa tgcaaaaagt gtgggagatg tgagctattt 3840 
tactcaaaag aaaacttatg atttttaatg ttgtcgttgt ttttgggtca tctaactaac 3900 
caaattcatg tattcactgt cttcctttat cagtactgga gaatgtggaa tatggtatgg 3960 
ttctcttcct aaacatcacc ttcttttgta cacaaaatag aagaagagag ctaattaaga 402 0 
tcttgttttc cttgacagcc tgttcataaa tggatggttc gacatatata cttcccgtgc 4080 
ttgcgcagca agataccaaa ggtgagtgag atatataccg atatgcaatt gtcgagattt 414 0 
gtttctgtga tataaattta accctccaca cacttgtttt tcagacactc gccattatca 4200 
ttgctttcct agtctctgca gtctttcatg aggtatacat actttctaca ttgccctgtc 4260 
tctagacgca tgaacacacg ctagtgaaag aaatgctaat attcaaagca ttgtttttac 432 0 
ttaacgatct tgtgttacaa atttcctttt gacagctatg catcgcagtt ccttgtcgtc 43 80 
tcttcaagct atgggctttt cttgggatta tgtttcaggt taaaaaatta ctaaactgct 444 0 
gcagtcgatt tttactaaac tctaatctca tattctgacc aaccaatttg tttgagtagg 4500 
tgcctttggt cttcatcaca aactatctac aggaaaggtt tggctcaacg gtatgctctc 456 0 
aaaacccgag aaaatagaac gaataactct ttctttcata gcctagccat ttaaatcgca 4 62 0 
atgctgaaac ttaataataa aggtgatctg ttttggaatg ggatcatatt attaggtggg 4680 
gaacatgatc ttctggttca tcttctgcat tttcggacaa ccgatgtgtg tgcttcttta 4740 
ttaccacgac ctgatgaacc gaaaaggatc gatgtcatga aacaactgtt caaaaaatga 48 00 
ctttcttcaa acatctatgg cctcgttgga tctccgttga tgttgtggtg gttctgatgc 4860 
taaaacgaca aatagtgtta taaccattga agaagaaaag aaaattagag ttgttgtatc 4 920 
tgcaaaaatt ttggtagaga cacgcgaacc cgtttggatt ttgttatggt gtaaagaaat 4980 
ttcaatcaaa aaactgttgt aataattgtt accaaaaaga aatgcttttc tggaaacgag 5040 
gggaaaaata gtagttttgt taggttttac tgtttggacc aaatctagta aaaaactttt 5100 
tgtaataagg aaaaaaaaag aacaaatgtg ataaatgcat ggggattgta tgaaaccttc 5160 
caataaagtt gattggtggt cccgttttgg gga 5193 



<210> 4 
<211> 498 
<212> PRT 
<213> mouse 

<400> 4 

Met Gly Asp Arg Gly Gly Ala Gly Ser Ser Arg Arg Arg Arg Thr Gly 
15 10 15 

Ser Arg Val Ser Val Gin Gly Gly Ser Gly Pro Lys Val Glu Glu Asp 
20 25 30 

Glu Val Arg Asp Ala Ala Val Ser Pro Asp Leu Gly Ala Gly Gly Asp 
35 40 45 

Ala Pro Ala Pro Ala Pro Ala Pro Ala His Thr Arg Asp Lys Asp Gly 
50 55 60 

Arg Thr Ser Val Gly Asp Gly Tyr Trp Asp Leu Arg Cys His Arg Leu 
65 70 75 80 
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Gin Asp Ser Leu Phe Ser Ser Asp Ser Gly Phe Ser Asn Tyr Arg Gly 
85 90 95 

lie Leu Asn Trp Cys Val Val Met Leu lie Leu Ser Asn Ala Arg Leu 
100 105 110 

Phe Leu Glu Asn Leu lie Lys Tyr Gly lie Leu Val Asp Pro lie Gin 
115 120 125 

Val Val Ser Leu Phe Leu Lys Asp Pro Tyr Ser Trp Pro Ala Pro Cys 
130 135 140 

Val lie lie Ala Ser Asn lie Phe Val Val Ala Ala Phe Gin lie Glu 
145 150 155 160 

Lys Arg Leu Ala Val Gly Ala Leu Thr Glu Gin Met Gly Leu Leu Leu 
165 170 175 

His Val Val Asn Leu Ala Thr lie lie Cys Phe Pro Ala Ala Val Ala 
180 185 190 

Leu Leu Val Glu Ser lie Thr Pro Val Gly Ser Val Phe Ala Leu Ala 
195 200 205 

Ser Tyr Ser lie Met Phe Leu Lys Leu Tyr Ser Tyr Arg Asp Val Asn 
210 215 220 

Leu Trp Cys Arg Gin Arg Arg Val Lys Ala Lys Ala Val Ser Thr Gly 
225 230 235 240 

Lys Lys Val Ser Gly Ala Ala Ala Gin Gin Ala Val Ser Tyr Pro Asp 
245 250 255 



Asn Leu Thr Tyr Arg Asp Leu Tyr 
260 

Cys Tyr Glu Leu Asn Phe Pro Arg 
275 280 

Leu Leu Arg Arg Val Leu Glu Met 
290 295 



Tyr Phe lie Phe Ala Pro Thr Leu 
265 270 

Ser Pro Arg lie Arg Lys Arg Phe 
285 

Leu Phe Phe Thr Gin Leu Gin Val 
300 



Gly Leu lie Gin Gin Trp Met Val Pro Thr lie Gin Asn Ser Met Lys 
305 310 315 320 

Pro Phe Lys Asp Met Asp Tyr Ser Arg lie lie Glu Arg Leu Leu Lys 
325 330 335 



Leu Ala Val Pro Asn His Leu lie 
340 

Phe His Ser Cys Leu Asn Ala Val 
355 360 



Trp Leu lie Phe Phe Tyr Trp Phe 
345 350 

Ala Glu Leu Leu Gin Phe Gly Asp 
365 



Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ala Glu Ser Val Thr Tyr Phe 
370 375 380 
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Trp Gin Asn Trp 
385 

Tyr Lys Pro Met 



Gly Val Phe Leu 
420 

Pro Leu Arg Met 
435 

Val Pro Leu Ala 
450 

Asn Ala Ala Val 
465 

Leu Met Tyr Val 



Gly Val 



Asn lie Pro Val 
390 

Leu Arg His Gly 
405 

Thr Ser Ala Phe 



Phe Arg Leu Trp 
440 

Trp lie Val Gly 
455 

Trp Val Thr Leu 
470 

His Asp Tyr Tyr 
485 



His Lys Trp Cys 
395 

Ser Ser Lys Trp 
410 

Phe His Glu Tyr 
425 

Ala Phe Thr Ala 



Arg Phe Phe Gin 
460 

He He Gly Gin 
475 

Val Leu Asn Tyr 
490 



He Arg His Phe 
400 

Val Ala Arg Thr 
415 

Leu Val Ser Val 

430 

Met Met Ala Gin 
445 

Gly Asn Tyr Gly 



Pro Val Ala Val 
480 

Asp Ala Pro Val 
495 



<210> 5 
<211> 488 
<212> PRT 
<213> human 

<400> 5 

Met Gly Asp Arg Gly Ser Ser Arg Arg Arg Arg Thr Gly Ser Arg Pro 
15 10 15 

Ser Ser His Gly Gly Gly Gly Pro Ala Ala Ala Glu Glu Glu Val Arg 
20 25 30 

Asp Ala Ala Ala Gly Pro Asp Val Gly Ala Ala Gly Asp Ala Pro Ala 
35 40 45 

Pro Ala Pro Asn Lys Asp Gly Asp Ala Gly Val Gly Ser Gly His Trp 
50 55 60 

Glu Leu Arg Cys His Arg Leu Gin Asp Ser Leu Phe Ser Ser Asp Ser 
65 70 75 80 

Gly Phe Ser Asn Tyr Arg Gly He Leu Asn Trp Cys Val Val Met Leu 
85 90 95 

He Leu Ser Asn Ala Arg Leu Phe Leu Glu Asn Leu He Lys Tyr Gly 
100 105 110 

He Leu Val Asp Pro He Gin Val Val Ser Leu Phe Leu Lys Asp Pro 
115 120 125 

His Ser Trp Pro Ala Pro Cys Leu Val He Ala Ala Asn Val Phe Ala 
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130 135 140 

Val Ala Ala Phe Gin Val Glu Lys Arg Leu Ala Val Gly Ala Leu Thr 
145 150 155 160 

Glu Gin Ala Gly Leu Leu Leu His Val Ala Asn Leu Ala Thr lie Leu 
165 170 175 

Cys Phe Pro Ala Ala Val Val Leu Leu Val Glu Ser lie Thr Pro Val 
180 185 190 

Gly Ser Leu Leu Ala Leu Met Ala His Thr lie Leu Phe Leu Lys Leu 
195 200 205 

Phe Ser Tyr Arg Asp Val Asn Ser Trp Cys Arg Arg Ala Arg Ala Lys 
210 215 220 

Ala Ala Ser Ala Gly Lys Lys Ala Ser Ser Ala Ala Ala Pro His Thr 
225 230 235 240 

Val Ser Tyr Pro Asp Asn Leu Thr Tyr Arg Asp Leu Tyr Tyr Phe Leu 
245 250 255 

Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro Arg 
260 265 270 

lie Arg Lys Arg Phe Leu Leu Arg Arg lie Leu Glu Met Leu Phe Phe 
275 280 285 

Thr Gin Leu Gin Val Gly Leu lie Gin Gin Trp Met Val Pro Thr lie 
290 295 300 

Gin Asn Ser Met Lys Pro Phe Lys Asp Met Asp Tyr Ser Arg lie lie 
305 310 315 320 

Glu Arg Leu Leu Lys Leu Ala Val Pro Asn His Leu lie Trp Leu lie 
325 330 335 

Phe Phe Tyr Trp Leu Phe His Ser Cys Leu Asn Ala Val Ala Glu Leu 
340 345 350 

Met Gin Phe Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ser Glu 
355 360 365 

Ser Val Thr Tyr Phe Trp Gin Asn Trp Asn lie Pro Val His Lys Trp 
370 375 380 

Cys lie Arg His Phe Tyr Lys Pro Met Leu Arg Arg Gly Ser Ser Lys 
385 390 395 400 

Trp Met Ala Arg Thr Gly Val Phe Leu Ala Ser Ala Phe Phe His Glu 
405 410 415 

Tyr Leu Val Ser Val Pro Leu Arg Met Phe Arg Leu Trp Ala Phe Thr 
420 425 430 

Gly Met Met Ala Gin lie Pro Leu Ala Trp Phe Val Gly Arg Phe Phe 



9 



435 



440 



445 



Gin Gly Asn Tyr Gly Asn Ala Ala Val Trp Leu Ser Leu He He Gly 
450 455 460 

Gin Pro He Ala Val Leu Met Tyr Val His Asp Tyr Tyr Val Leu Asn 



465 



470 



475 



480 



Tyr Glu Ala Pro Ala Ala Glu Ala 
485 



<210> 6 
<211> 629 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 6 

tgcatgtata 

attcatggga 

tttgaaaggc 

atatgtgtgg 

gcttctctgc 

agattactgg 

ccttccccgt 

ttcccaagcc 

ttcaaagcaa 

aannttcccc 

aattttggga 



cggaagggtt 
tttataatag 
gatcttctat 
ctctgcatgt 
ttcggggatc 
gagaatgtgg 
gcttgcgcac 
ccctggaggc 
agggggnttn 
cgggaaaggg 
acccgggggg 



gggtggctcg 
aacaatatat 
atgctattga 
tctactgctt 
gtgaattcta 
aatatgcctg 
aaggattacc 
ctttccatgg 
cctggggnta 
ttgcccaccg 
ggccttttt 




tcaatttgca 
aaatcctatt 
aagagtgttg 
cttccacctt 
caaagattgg 
tccataaatg 
caaagacacc 
gccanggacc 
aagntccang 

gggggngaaa 



aaactggtca 
gtcaggaact 
aagctttcag 
tggttaaaca 
tggaatgcaa 
ggatgggtcc 
ccggccatta 
cggngtnccc 
ggcccttggg 
aanncccggg 



tattcaccgg 60 
caaagcatcc 120 
ttccaaattt 180 
tattggcaga 240 
aaagtgtggg 3 00 
gacatatata 360 
accattggct 420 
tggcnggccc 480 
gcccanccaa 540 
ggcaccncgg 600 
629 



<210> 7 
<211> H 
<212> PRT 

<213> Arabidopsis thaliana 

<400> 7 

His Lys Trp Met Val Arg His He Tyr Phe Pro 
15 10 



<210> 8 
<211> 341 
<212> PRT 

<213> Brassica napus 
<400> 8 

Met Cys Cys Leu Ser Leu Ser He 
1 5 

Glu Lys Leu Val Leu Gin Lys Cys 
20 



Phe Pro Leu Ala Ala Phe Thr Val 
10 15 

He Ser Glu Pro Val Val He Phe 
25 30 



Leu His Val He He Thr Met Thr Glu Val Leu Tyr Pro Val Tyr Val 
35 40 45 
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Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Asp Thr Leu Met Leu 
50 55 60 

Leu Thr Cys lie Val Trp Leu Lys Leu Val Ser Tyr Ala His Thr Asn 
65 70 75 80 

Tyr Asp lie Arg Thr Leu Ala Asn Ser Ser Asp Lys Ala Asn Pro Glu 
85 90 95 

Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met Leu Ala 
100 105 110 

Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Pro Cys lie Arg 
115 120 125 

Lys Gly Trp Val Ala Arg Gin Phe Ala Lys Leu Val lie Phe Thr Gly 
130 135 140 

Leu Met Gly Phe lie lie Glu Gin Tyr lie Asn Pro lie Val Arg Asn 
145 150 155 160 

Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala lie Glu Arg Val 
165 170 175 

Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met Phe Tyr 
180 185 190 

Cys Phe Phe His Leu Trp Leu Asn lie Leu Ala Glu Leu Leu Cys Phe 
195 200 205 

Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Ser Val Gly 
210 215 220 

Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met Val Arg 
225 230 235 240 

His Val Tyr Phe Pro Cys Leu Arg lie Lys He Pro Lys Val Pro Ala 
245 250 255 

He He He Ala Phe Leu Val Ser Ala Val Phe His Glu Leu Cys He 
260 265 270 

Ala Val Pro Cys Arg Leu Phe Asn Leu Trp Ala Phe Met Gly He Met 
275 280 285 

Phe Gin Val Pro Leu Val Phe He Thr Asn Phe Leu Gin Glu Arg Phe 
290 295 300 

Gly Ser Met Val Gly Asn Met He Phe Gly Ser Ala Ser Cys He Phe 
305 310 315 320 

Gly Gin Pro Met Cys Gly Leu Leu Tyr Tyr His Asp Leu Met Asn Arg 
325 330 335 



Lys Gly Ser Met Ser 
340 
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<210> 9 
<211> 503 
<212> PRT 

<213> Brassica napus 
<400> 9 

Met Ala lie Leu Asp Ser Gly Gly Val Ala Val Pro Pro Thr Glu Asn 
15 10 15 

Gly Val Ala Asp Leu Asp Arg Leu His Arg Arg Lys Ser Ser Ser Asp 
20 25 30 

Ser Ser Asn Gly Leu Leu Ser Asp Thr Ser Pro Ser Asp Asp Val Gly 
35 40 45 

Ala Ala Ala Ala Glu Arg Asp Arg Val Asp Ser Ala Ala Glu Glu Glu 
50 55 60 

Ala Gin Gly Thr Ala Asn Leu Ala Gly Gly Asp Ala Glu Thr Arg Glu 
65 70 75 80 

Ser Ala Gly Gly Asp Val Arg Phe Thr Tyr Arg Pro Ser Val Pro Ala 
85 90 95 

His Arg Arg Thr Arg Glu Ser Pro Leu Ser Ser Asp Ala lie Phe Lys 
100 105 110 

Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu Val Ala 
115 120 125 

Val Asn Ser Arg Leu lie lie Glu Asn Leu Met Lys Tyr Gly Trp Leu 
130 135 140 

lie Arg Thr Asp Phe Trp Phe Ser Ser Thr Ser Leu Arg Asp Trp Pro 
145 150 155 160 

Leu Phe Met Cys Cys Leu Ser Leu Ser Val Phe Pro Leu Ala Ala Phe 
165 170 175 

Thr Val Glu Lys Met Val Leu Gin Lys Phe lie Ser Glu Pro Val Ala 
180 185 190 

lie He Leu His Val He He Thr Met Thr Glu Val Leu Tyr Pro Val 
195 200 205 

Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr Leu 
210 215 220 

Met Leu Leu Thr Cys He Val Trp Leu Lys Leu Val Ser Tyr Ala His 
225 230 235 240 



Thr Ser Tyr Asp He Arg Thr Leu Ala Asn Ser Ala Asp Lys Val Asp 
245 250 255 
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Pro Glu He Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met 
260 265 270 

Val Ala Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Pro Cys 
275 280 285 

He Arg Lys Gly Trp Val Ala Arg Gin Leu Ala Lys Leu Val He Phe 
290 295 300 

Thr Gly Leu Met Gly Phe He He Glu Gin Tyr He Asn Pro He Val 
305 310 315 320 

Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala He Glu 
325 330 335 

Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met 
340 345 350 

Phe Tyr Cys Phe Phe His Leu Trp Leu Asn He Leu Ala Glu Leu Leu 
355 360 365 

Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Ser 
370 375 380 

Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met 
385 390 395 400 

Val Arg His Val Tyr Phe Pro Cys Leu Arg He Lys He Pro Lys Val 
405 410 415 

Pro Ala He He He Ala Phe Leu Val Ser Ala Val Phe His Glu Leu 
420 425 430 

Cys He Ala Val Pro Cys Arg Leu Phe Asn Leu Trp Ala Phe Met Gly 
435 440 445 

He Met Phe Gin Val Pro Leu Val Phe He Thr Asn Phe Leu Gin Glu 
450 455 460 

Arg Phe Gly Ser Met Val Gly Asn Met He Phe Gly Ser Ala Ser Cys 
465 470 475 480 

He Phe Gly Gin Pro Met Cys Gly Leu Leu Tyr Tyr His Asp Leu Met 
485 490 495 

Asn Arg Lys Gly Ser Met Ser 



500 



<210> 10 
<211> 11 
<212> PRT 



0 



<220> 

<221> VARIANT 



<213> Arabidopsis thaliana 
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; <222> (1) 
<220> 

<221> VARIANT 
<222> (3) 

<220> 

<221> VARIANT 
<222> (5) . . (6) 

<220> 

<221> VARIANT 
\jc222> (8) . . (10) 

<400> 10 

Xaa Leu Xaa Lys Xaa Xaa Ser Xaa Xaa Xaa Val 
15 10 



<210> 11 
<211> 27 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 11 

Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu lie Ala Val 
15 10 15 

Asn Ser Arg Leu lie lie Glu Asn Leu Met Lys 
20 25 



<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer of 
DGATXbal 

<400> 12 

ctagtctaga atggcgattt tgga 



<210> 13 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer of 
DGATXhol 



<400> 13 



14 



gcgctcgagt ttcatgacat cga 



<210> 14 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGATl 
<400> 14 

agacacgaat cccattccca ccga 



<210> 15 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT2 
<400> 15 

agtggtgaca acgcagggat gatg 



<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT3 
<400> 16 

atggtcgctc ccacattgtg t 



<210> 17 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT4 
<400> 17 

catacaatcc ccatgacatt tatca 



<210> 18 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: Primer A 
<400> 18 

cgaccgtcgg ttccagctca tcgg 



<210> 19 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer B 
<400> 19 

gcggccaatc tcgcagcgat cttg 



<210> 20 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer C 
<400> 20 

taaacagtag actcatcatc g 



<210> 21 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer Gen 1 
<400> 21 

gagaggatcc acgctcacga cccattcttc ccg 



<210> 22 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer Gen 2 
<400> 22 

aagaaggatc catccccaaa acgggaccac caa 



<210> 23 
<211> 1985 
<212> DNA 
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<213> Arabidopsis thaliana 
<400> 23 

atttcttagc ttcttccttc aatccgctct 
tcaatttctt ctgcatgctt ctcgattctc 
cgtcaaacgc ttttcgaaat ggcgattttg 
aacggtggcg gagagttcgt cgatcttgat 
tcttctaacg gacttcttct ctctggttcc 
gctcccgccg acgttaggga tcggattgat 
gccaatttgg ccggagataa taacggtggt 
ggagaaggaa gaggaaacgc cgatgctacg 
cggagggcga gagagagtcc acttagctcc 
ttattcaacc tctgtgtagt agttcttatt 
cttatgaaga gccatgccgg attattcaac 
agtagactca tcatcgaaaa tcttatgaag 
tttagttcaa gatcgctgcg agattggccg 
tttcctttgg ctgcctttac ggttgagaaa 
gttgtcatct ttcttcatat tattatcacc 
accctaaggt gtgattctgc ttttttatca 
gtgtggctaa agttggtttc ttatgctcat 
gcagctgata aggccaatcc tgaagtctcc 
ttcatggtcg ctcccacatt gtgttatcag 
aagggttggg tggctcgtca atttgcaaaa 
ataatagaac aatatataaa tcctattgtc 
cttctatatg ctattgaaag agtgttgaag 
tgcatgttct actgcttctt ccacctttgg 
ggggatcgtg aattctacaa agattggtgg 
atgtggaata tgcctgttca taaatggatg 
agcaagatac caaagacact cgccattatc 
gagctatgca tcgcagttcc ttgtcgtctc 
tttcaggtgc ctttggtctt catcacaaac 
gggaacatga tcttctggtt catcttctgc 
tattaccacg acctgatgaa ccgaaaagga 
gactttcttc aaacatctat ggcctcgttg 
gctaaaacga caaatagtgt tataaccatt 
tctgcaaaaa ttttggtaga gacacgcaaa 
gccgc 



ttccctctcc attagattct gtttcctctt 60 
tctgacgcct cttttctccc gacgctgttt 120 
gattctgctg gcgttactac ggtgacggag 180 
aggcttcgtc gacggaaatc gagatcggat 240 
gataataatt ctccttcgga tgatgttgga 3 00 
tccgttgtta acgatgacgc tcagggaaca 360 
ggcgataata acggtggtgg aagaggcggc 420 
tttacgtatc gaccgtcggt tccagctcat 48 0 
gacgcaatct tcaaacagag ccatgccgga 54 0 
gctgtaaaca gtagactcat catcgaaaat 600 
ctctgtgtag tagttcttat tgctgtaaac 660 
tatggttggt tgatcagaac ggatttctgg 720 
cttttcatgt gttgtatatc cctttcgatc 780 
ttggtacttc agaaatacat atcagaacct 840 
atgacagagg ttttgtatcc agtttacgtc 900 
ggtgtcactt tgatgctcct cacttgcatt 960 
actagctatg acataagatc cctagccaat 1020 
tactacgtta gcttgaagag cttggcatat 1080 
ccaagttatc cacgttctgc atgtatacgg 1140 
ctggtcatat tcaccggatt catgggattt 1200 
aggaactcaa agcatccttt gaaaggcgat 1260 
ctttcagttc caaatttata tgtgtggctc 1320 
ttaaacatat tggcagagct tctctgcttc 1380 
aatgcaaaaa gtgtgggaga ttactggaga 1440 
gttcgacata tatacttccc gtgcttgcgc 1500 
attgctttcc tagtctctgc agtctttcat 1560 
ttcaagctat gggcttttct tgggattatg 1620 
tatctacagg aaaggtttgg ctcaacggtg 1680 
attttcggac aaccgatgtg tgtgcttctt 1740 
tcgatgtcat gaaacaactg ttcaaaaaat 1800 
gatctccgtt gatgttgtgg tggttctgat 1860 
gaagaagaaa agaaaattag agttgttgta 1920 
cccgtttgga ttttgttatg gtgtaaagcg 1980 

1985 



<210> 24 
<211> 5339 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 24 

gctcacgacc cattcttccc gttccatttg 
tttgtataac attcaaatct tcacatgatt 
caataggggg agttgactta aaatagctat 
attatactaa agaaaatttt tgatttgtct 
aaaaaatatg tattgttaat cttaaaaatg 
tcaaaaccgt attcatagac cgatgtgaga 
atatcgtatc tccaaatcaa tcacttagaa 
taaatatata tatatatata tatatatatc 
gtcaaaaacc atgactgaat caaactataa 
agtattaatt tccattatgt ttctatactt 
caaaaaagta gatttgtgta tattattaga 
aaagtacttc cctaattaaa tcatgtcggt 



gttttattta tttcaaagtt taatattcct 60 
gattgtgtga aaaccccaca gattttacta 120 
tgatgtcgaa aaaatgtatt ttagttataa 180 
gttgtttaag catatgtatt gttaaactta 240 
taggagtaca catcaaatac tcgagcataa 300 
atcaaataga agataatgtg attttttaaa 360 
gataatgtaa ttctttatgt gctacataaa 420 
ttgtatatat gtcttgacaa aaaattgcca 48 0 
gtcggattga atcaaactat aagtcggatg 540 
tacaaaccgg aaaatagata ttatagatac 600 
agatttggaa tttcatcatt atcaggatct 660 
tgaaaaagct caatgaatgt ttgaaatttg 720 
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gaaagtttat taaattcgga tctttttttt ttgtttgtcg tcccaaacat ttttatttta 780 
ttacaaataa tcaacttatc cttactacta aatcatttca tatctttgat accaacaaat 840 
catttcatat tctattttga tgtttaagaa aacactattt accagttaca aaatattata 900 
aggattgttg tttagaaaaa aaagtacaag ttgaattctt tttgtcaaat ataaaattga 960 
ctttttaata tataattgac ttattgaaca tgattacaga attaatcatc tacaaaactt 102 0 
tccaagttta taataaatac atttcaaaga ctattagttc ttcttaaaat atttctaaaa 1080 
gtgatcaaag actaccacat ataattcaga aaaagtagaa gttgatttct ttttgtcaaa 1140 
taaataattg acttaaaata gtttggaaag ccattgaact tgattataga attgataatg 1200 
tacataaaaa aattccaagt ttataataaa tacatttttc aaatgctata tcagttcttc 1260 
ttaaaatatt tcactaaaaa aacactcaaa tatagaataa atttattgaa taacatacca 1320 
actgtaaaac agaatttgac aaaaaaaaaa aaaaaatgaa atgaagatga agacaaaaat 1380 
aaatcaccag aggatcttat gcaaaaaaat atatgaatac acaataaacc atattgatat 1440 
ttttaaaata aaataaaaac agaaaaatat cccaacaccg cttttcaatt aaaaatcttc 1500 
cgtcaccatt gttgtcatct tcctctctcg tgaatccttt ttcctttctt cttcttcttc 1560 
tcttcagaga aaactttgct tctctttcta taaggaacca gacacgaatc ccattcccac 1620 
cgatttctta gcttcttcct tcaatccgct ctttccctct ccattagatt ctgtttcctc 1680 
tttcaatttc ttctgcatgc ttctcgattc tctctgacgc ctcttttctc ccgacgctgt 1740 
ttcgtcaaac gcttttcgaa atggcgattt tggattctgc tggcgttact acggtgacgg 1800 
agaacggtgg cggagagttc gtcgatcttg ataggcttcg tcgacggaaa tcgagatcgg 1860 
attcttctaa cggacttctt ctctctggtt ccgataataa ttctccttcg gatgatgttg 1920 
gagctcccgc cgacgttagg gatcggattg attccgttgt taacgatgac gctcagggaa 1980 
cagccaattt ggccggagat aataacggtg gtggcgataa taacggtggt ggaagaggcg 2040 
gcggagaagg aagaggaaac gccgatgcta cgtttacgta tcgaccgtcg gttccagctc 2100 
atcggagggc gagagagagt ccacttagct ccgacgcaat cttcaaacag gtttaaaatc 2160 
tcagaaatct tcgaatttgg tgtttgcttg ttgttttata tggaattgag tttggtgatt 2220 
gttttgcatt gcagagccat gccggattat tcaacctctg tgtagtagtt cttattgctg 2280 
taaacagtag actcatcatc gaaaatctta tgaaggtttg ctgttacttg tttctccttt 2340 
taggaattga attgcttgaa aatttatcat tgcattgcag agccatgccg gattattcaa 2400 
cctctgtgta gtagttctta ttgctgtaaa cagtagactc atcatcgaaa atcttatgaa 2460 
ggtttgctgt tacttgtttc tccttttagg aattgaattg cttgaaaatt tatcagagac 2520 
gaataacttt gttgttgcta tcattcatgt agtatggttg gttgatcaga acggatttct 2580 
ggtttagttc aagatcgctg cgagattggc cgcttttcat gtgttggtaa aagaagatgt 2640 
tttttatttc cagcaatgtt acattgttat acgtataatg atgagtttag tgatcaagtt 2700 
cctctttgat tcttctttct tgttgcagta tatccctttc gatctttcct ttggctgcct 2760 
ttacggttga gaaattggta cttcagaaat acatatcaga acctgtgagt aattactatt 2820 
ctccagccat tactgtaatt tttattgaag acaagtttgt atcatgaaga acttacaagt 2880 
tctgttttga aaatgctcaa ggttgtcatc tttcttcata ttattatcac catgacagag 2940 
gttttgtatc cagtttacgt caccctaagg tgatactgtt tttctggtct cagtttgtga 3000 
tactgttttt aagtttagtt gtctgacccg gtgatcttga aaatggacag gtgtgattct 3060 
gcttttttat caggtgtcac tttgatgctc ctcacttgca ttgtgtggct aaagttggtt 3120 
tcttatgctc atactagcta tgacataaga tccctagcca atgcagctga taaggtaaaa 3180 
tacgaaaaag aagcgtatgt attagtcact tgcactgtgt tactgtttta accaaacact 3240 
gttatgaact ttaggccaat cctgaagtct cctactacgt tagcttgaag agcttggcat 3300 
atttcatggt cgctcccaca ttgtgttatc aggtaactgc aaagtgcatc aaccattctt 3360 
atacttgcaa gagtttcttg tctaaacctc ggatctttgc ttttccccag ccaagttatc 3420 
cacgttctgc atgtatacgg aagggttggg tggctcgtca atttgcaaaa ctggtcatat 3480 
tcaccggatt catgggattt ataatagaac aagtacgttt tcacatcttg ctttattagt 3540 
tttccttggt gaaaatcatc atccctgcgt tgtcaccact tgacttcatg ttcttttgtt 3 600 
acattttggc agtatataaa tcctattgtc aggaactcaa agcatccttt gaaaggcgat 3660 
cttctatatg ctattgaaag agtgttgaag ctttcagttc caaatttata tgtgtggctc 3720 
tgcatgttct actgcttctt ccacctttgg tatgctgtga tcccatctct ttcaaaataa 3780 
tttgcaaatt cgaaaaaccg aaaaaggcta aatctcatac gaatttgata tttttagttt 3 840 
cttagagtcg gtgatgtaat ttcagttact gaacgcaaat ctcttgtcca aaggttaaac 3900 
atattggcag agcttctctg cttcggggat cgtgaattct acaaagattg gtggaatgca 3 960 
aaaagtgtgg gagatgtgag ctattttact caaaagaaaa cttatgattt ttaatgttgt 4020 
cgttgttttt gggtcatcta actaaccaaa ttcatgtatt cactgtcttc ctttatcagt 4080 
actggagaat gtggaatatg gtatggttct cttcctaaac atcaccttct tttgtacaca 4140 
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aaatagaaga agagagctaa ttaagatctt gttttccttg acagcctgtt cataaatgga 4200 
tggttcgaca tatatacttc ccgtgcttgc gcagcaagat accaaaggtg agtgagatat 4260 
ataccgatat gcaattgtcg agatttgttt ctgtgatata aatttaaccc tccacacact 4320 
tgtttttcag acactcgcca ttatcattgc tttcctagtc tctgcagtct ttcatgaggt 4380 
atacatactt tctacattgc cctgtctcta gacgcatgaa cacacgctag tgaaagaaat 4440 
gctaatattc aaagcattgt ttttacttaa cgatcttgtg ttacaaattt ccttttgaca 4500 
gctatgcatc gcagttcctt gtcgtctctt caagctatgg gcttttcttg ggattatgtt 4560 
tcaggttaaa aaattactaa actgctgcag tcgattttta ctaaactcta atctcatatt 4620 
ctgaccaacc aatttgtttg agtaggtgcc tttggtcttc atcacaaact atctacagga 4680 
aaggtttggc tcaacggtat gctctcaaaa cccgagaaaa tagaacgaat aactctttct 4740 
ttcatagcct agccatttaa atcgcaatgc tgaaacttaa taataaaggt gatctgtttt 4800 
ggaatgggat catattatta ggtggggaac atgatcttct ggttcatctt ctgcattttc 4860 
ggacaaccga tgtgtgtgct tctttattac cacgacctga tgaaccgaaa aggatcgatg 4 920 
tcatgaaaca actgttcaaa aaatgacttt cttcaaacat ctatggcctc gttggatctc 4980 
cgttgatgtt gtggtggttc tgatgctaaa acgacaaata gtgttataac cattgaagaa 5 040 
gaaaagaaaa ttagagttgt tgtatctgca aaaattttgg tagagacacg cgaacccgtt 5100 
tggattttgt tatggtgtaa agaaatttca atcaaaaaac tgttgtaata attgttacca 5160 
aaaagaaatg cttttctgga aacgagggga aaaatagtag ttttgttagg ttttactgtt 522 0 
tggaccaaat ctagtaaaaa actttttgta ataaggaaaa aaaaagaaca aatgtgataa 5280 
atgcatgggg attgtatgaa accttccaat aaagttgatt ggtggtcccg ttttgggga 533 9 



<210> 25 
<211> 547 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 25 

Met Ala lie Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 
15 10 15 

Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 
20 25 30 

Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 
35 40 45 

Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg lie Asp 
50 55 60 

Ser Val Val Asn Asp Asp Ala Gin Gly Thr Ala Asn Leu Ala Gly Asp 
65 70 75 80 

Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 
85 90 95 

Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 
100 105 110 

Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala lie Phe 
115 120 125 

Lys Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu lie 
130 135 140 



Ala Val Asn Ser Arg Leu lie lie Glu Asn Leu Met Lys Ser His Ala 
145 150 155 160 
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Gly Leu Phe Asn Leu Cys Val Val Val Leu He Ala Val Asn Ser Arg 
165 170 175 

Leu He He Glu Asn Leu Met Lys Tyr Gly Trp Leu He Arg Thr Asp 
180 185 190 

Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp Pro Leu Phe Met Cys 
195 200 205 

Cys He Ser Leu Ser He Phe Pro Leu Ala Ala Phe Thr Val Glu Lys 
210 215 220 

Leu Val Leu Gin Lys Tyr He Ser Glu Pro Val Val He Phe Leu His 
225 230 235 240 

He He He Thr Met Thr Glu Val Leu Tyr Pro Val Tyr Val Thr Leu 

245 250 255 

Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr Leu Met Leu Leu Thr 
260 265 270 

Cys He Val Trp Leu Lys Leu Val Ser Tyr Ala His Thr Ser Tyr Asp 
275 280 285 

He Arg Ser Leu Ala Asn Ala Ala Asp Lys Ala Asn Pro Glu Val Ser 
290 295 300 

Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met Val Ala Pro Thr 
305 310 315 320 

Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Ala Cys He Arg Lys Gly 
325 330 335 

Trp Val Ala Arg Gin Phe Ala Lys Leu Val He Phe Thr Gly Phe Met 
340 345 350 

Gly Phe He He Glu Gin Tyr He Asn Pro lie Val Arg Asn Ser Lys 
355 360 365 

His Pro Leu Lys Gly Asp Leu Leu Tyr Ala He Glu Arg Val Leu Lys 
370 375 380 

Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met Phe Tyr Cys Phe 
385 390 395 400 

Phe His Leu Trp Leu Asn He Leu Ala Glu Leu Leu Cys Phe Gly Asp 
405 410 415 

Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Ser Val Gly Asp Tyr 
420 425 430 

Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met Val Arg His He 
435 440 445 

Tyr Phe Pro Cys Leu Arg Ser Lys He Pro Lys Thr Leu Ala He He 
450 455 460 
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lie Ala Phe Leu 
465 

Pro Cys Arg Leu 



Val Pro Leu Val 
500 

Thr Val Gly Asn 
515 

Pro Met Cys Val 
530 



Val Ser Ala Val 
470 

Phe Lys Leu Trp 
485 

Phe lie Thr Asn 



Met lie Phe Trp 
520 

Leu Leu Tyr Tyr 
535 



Phe His Glu Leu 
475 

Ala Phe Leu Gly 
490 

Tyr Leu Gin Glu 
505 

Phe lie Phe Cys 



His Asp Leu Met 
540 



Cys He Ala Val 
480 

He Met Phe Gin 
495 

Arg Phe Gly Ser 
510 

He Phe Gly Gin 
525 

Asn Arg Lys Gly 



Ser Met Ser 
545 



